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Preface 


In this second volume on the Foundations of Quantum Mechanics we shall 
show how it is possible, using the methodology presented in Volume I, to 
deduce some of the most important applications of quantum mechanics. 
These deductions are concerned with the structures of the microsystems 
rather than the technical details of the construction of preparation and 
registration devices. Accordingly, the only new axioms (relative to Volume I) 
which are introduced are concerned with the relationship between ensemble 
operators W, effect operators F, and certain construction principles of the 
preparation and registration devices. The applications described here are 
concerned with the measurement of atomic and molecular structure and of 
collision experiments. 

An additional and essential step towards a theoretical description of the 
preparation and registration procedures is carried out in Chapter XVII. 
Here we demonstrate how microscopic collision processes (that is, processes 
which can be described by quantum mechanics) can be used to obtain novel 
preparation and registration procedures if we take for granted the knowledge 
of only a few macroscopic preparation and registration procedures. By clever 
use of collision processes we are often able to obtain very precise results for 
the operators W and F which describe the total procedures from a very 
imprecise knowledge of the macroscopic parts of the preparation and regis¬ 
tration processes. In this regard experimental physicists have done brilliant 
work. In this sense Chapter XVII represents a general theoretical foundation 
for the procedures used by experimental physicists. 

Thus Chapters II to XVII represent a complete foundation of quantum 
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theory as far as experimental practice is concerned. Fundamental questions 
about the relationships between quantum mechanics and the objective 
description of macroscopic systems (that is, a “statistical mechanics” 
description), and the related problem of the completion of the measurement 
process in the macroscopic domain are not treated in this book. Readers 
will find a few critical comments and discussion of these problems in Chapter 
XVIII. A fundamental treatment of these problems can be found in [7]. 

I hope that this second volume, and Volume I, will contribute to the 
elimination of false problems in quantum mechanics. The two volumes 
present a formulation of quantum mechanics which is self-consistent and 
can describe all possible experiments in the application range of the theory. 
Whether this description can fulfill the conscious or unconscious ideological 
and aesthetic desires of all readers is another problem. 

References in the text are made as follows: For references to other sections 
of the same chapter, we shall only list the section number of the reference, 
for example, §3.1. For references to other chapters, the chapter is also given; 
for example, XVII, §2.3 refers to Chapter XVII, Section 2.3. The formulas are 
numbered as follows: (2.3.10) refers to the 10th formula in Section 2.3 of the 
current chapter. References to formulas in other chapters are given, for 
example, by XIV (2.3.10). References to the Appendix are given by AV, §2, 
where AV denotes Appendix V. 

Again, as in the case of Volume I, I wish to express my deep gratitude to 
Mr. Carl A. Hein who has undertaken the difficult task of translating these 
two volumes into English. The translation has been especially difficult 
because of the wide range of topics discussed in these two volumes, ranging 
from the coverage of various specialized areas in advanced mathematics, 
philosophical questions in quantum mechanics, and the discussion of 
theoretical and experimental aspects of quantum physics. I would also like 
to thank Mr. Hein for accommodating my wishes to make late alterations 
to the text which I hope will make the book more understandable to readers. 

Marburg, January 1984 G. Ludwig 
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CHAPTER IX 


Representation of Hilbert Spaces by 
Function Spaces 


The representation of Hilbert spaces in terms of function spaces is of great 
importance in the application of quantum mechanics. We have already 
encountered such representations in VII, §2 in the form SS 2 (R 3 , dk l dk 2 dk 3 ) 
and in VII, §3 in the form S£ 2 (R 3 , dx x dx 2 dx 3 ), where we have considered 
isomorphic maps of <£ 2 (R 3 , dx x dx 2 dx 3 ) into if 2 (R 3 , dk x dk 2 dk 3 ) which are 
defined by Fourier transforms. Such (and similar) representations are 
especially suitable for the treatment of physical problems. This chapter does 
not introduce any new axioms, that is, any new physical laws; instead, we 
introduce a number of suitably chosen tools for the analysis of the physical 
laws in the previous chapters. In this chapter we shall consider only a single 
Hilbert space jf, that is, a single-system type. 


1 Maximal Decision Observables 

The starting point for our discussion of particular representations of a 
Hilbert space is the discussion of measurement scales for observables 
presented in IV, §2.5, and, in particular, the discussion of “scale observables” 
as described in IV, D 2.5.6. 

We shall now assume that we are given a complete Boolean ring 
E = E(y 1; ..., >'„) of decision effects, the scale for which is uniquely deter¬ 
mined by the specification of n scale observables A u ... ,A n (which mutually 
commute, see the remarks preceding IV, D 2.5.6). In IV, §2.5 we have seen 
that, to each Boolean ring E of decision effects there exists a finite number of 

1 
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scale observables A u ..., A„ for which £ = £(y x , . • •, v„). Indeed, we may 
choose n = 1. 

Because we frequently find that certain A v (and the corresponding scales) 
are physically determined on the basis of transformation groups (see, for 
example, VII and VIII) we will therefore not assume that n = 1. Since it is 
not, however, difficult to extend the structures described for the case n = 1 
to the case n > 1, the following discussion will be more transparent if, in the 
following, we examine the case in which n = 1, and then carry out the exten¬ 
sion to n > 1 wherever necessary. 

A Boolean ring £ is said to be maximal, as a subset of G, if there does not 
exist a Boolean ring £ <= G for which £ |e £. A maximal £ is, on the basis of 
IV, Th. 1.4.7, complete. According to IV, Th. 1.3.6 £ is maximal if there exists 
no £ e G, £ £ £ which is commensurable with all elements of £. 

Th, 1.1. To each £ c: G there exists a maximal £ c Gfor which £ cz £. 

The proof of this result follows directly from Zorn’s lemma, because, for a 
totally ordered subset of Boolean rings £ A cz G for which £ cz £ A , the set 
(J A £ a is a Boolean ring. 

Since, according to IV, §2.5 each £ may be represented as £(y 1; ..., y„) 
we shall, for the purpose of discussion of the representations of Hilbert 
space, always use maximal Boolean rings £(y x ,..., y„) <= G. 

D 1.1. A maximal Boolean subring of G is said to be a maximal decision 
observable. A n-tuple of commuting A u ..., A n is said to be maximal if the 
Boolean ring £(y x ,..., y„) generated by this n-tuple is maximal. 

If {T a } is a set (not necessarily finite) of self-adjoint commuting operators 
and if £ is the Boolean ring generated by the spectral families of the A k then 
there exists a maximal £ for which £ c: £. According to IV, Th. 2.5.6, to £ 
there exists a y such that £ = £(y) and, according to IV, Th. 2.5.9 there 
exists a maximal scale observable A for which the corresponding Boolean 
ring is equal to £. Since £(y A ) <=£<=£, A } is, according to IV, Th. 2.5.11, 
a function / a (A). To each set {A k } of coexistent (and therefore commensur¬ 
able) scale observables there exists a scale observable A for which all the A x 
are functions of A. 

We shall now introduce the following notation: For the Boolean ring 
£ c G and <p e let ^~(£, <p) denote the (closed) subspace of Jf generated 
by {Ecp\E e £}; let £(£, <p) denote the projection operator corresponding 
to .3 r (£, (p). Since 1 e £ we find that cp e AT (£, (p). 

Th. 1.2. £(£, <p) is commensurable to all elements of £. 

Proof. For x e -TCL, <p) it follows that Ex e .T (£, <p) for all Eel., since this result is 

trivial for all x of the form E(p where Eel, since EE e 1. For ijj e Jf'' we find that 
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£(£, (p)if e -T(E, <p). Therefore, for £ e £ we obtain ££(£, ip)if e -TEL, <p), that is, 
£(£, <p)££(£, <p)if = EE(£, <p)\f. Since if was arbitrary it follows that 

£(£, <?>)££(£, <p) = ££(£, <p). 

From the adjoint equation we obtain £(£, <p)EE(£, q>) = £(£, <p)E and we have 
proven that £ and £(£, <p) commute. 

Th. 1.3. For £ there exists a (p e for which w = P v is effective on £, 
that is, for E e £ the following relationship is satisfied : 

tr (P^E) = (cp, Ecp) = <<p, E 2 cp) = ||£<p|| 2 = 0 => E = 0. 

// £ is maximal, then there exists a cp e for which ,3 r (£, (p) = 

Proof. If £ is not maximal, then, according to Th. 1.1 there exists a maximal £ such 
that £ e £. If, according to the second part of Th. 1.3, there exists a <p such that 
■T(L, <p) = X, then, if for £ e £ we obtain \\Eq>\\ = 0, then for if from the projection 
space of £ and for arbitrary £ 6 £ we obtain 

(<p, E(py = (Et//, E(p ) = (if, EEtpf = (if, EE<p ) = 0, 

that is, if is orthogonal to all {Ecp \ E e £} and therefore is orthogonal to 2T (£, (p) = M', 
from which it follows that if = 0 and £ = 0. 

Now we need only prove the second part of Th. 1.3. We shall assume that £ is 
maximal. Let {y v } denote a dense denumerable subset in Jf. We define 

*= V^( s -Xv). 

»= i 

Therefore {J%} is an increasing sequence of subspaces. Since x,, e (£, y v ) we find 
that Xv e for v < n. Thus it follows that \j„ = 

We shall now show that, to each there exists a if„ such that = -T(L, if n ). We 
shall prove this result by induction. 

For JXfj we need only choose if, = Xi- Therefore, according to the induction 
hypothesis ^_ 1 =^'(£, i f„-f). Therefore we find that = .T(L, ifn-,) v 
Xn)- Let £„ denote the projection operator on :tf n . 

We therefore obtain £„ = £(£, i) v £(£, ■/„). Since, according to Th. 1.2, 
£(£, if n - [) and £(£, y„) are commensurable with all £ 6 £, then, according to IV, 
Th. 1.3.5, £„ is commensurable with all £ e £. Since £ is maximal, we therefore find 
that £„ e £ and £(£, x„) e £. 

We now choose if„ as follows: 

'f„ = >f„-i + (1 - 
By multiplication with an £ e £ it follows that 


Eif„ = Eif„- 1 + £(1 - £„-,)x„- 

Since £„_ , e £ and therefore 1 — £„_, e £ we find that £(1 - £„_ 0 e £ and we 
obtain £(1 — E n -f)x„e x„). Thus it follows that Eif„e and therefore 

^-(£, ^„) cz 

Since E n ^ l Eif„ = E„~ 1 Eif„~ l = EE nl if n -, = Eif n _ 1 we obtain Eif„_, e 
■^(£, if„) and -3^-j = .r(£, (>„_,) c ^(2, if*)- Since (1 - E„- l )Eif„ = 
E(i - £„-i)i/'„-i + £(1 - £„ i)x„ = £(1 - E„ .i)x„ = (1 - £„-i)£x„ (because 
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E n -gf„- i = 'l'n-i 0 we obtain (1 - £„_ y ).T(L, yj = 3T(L, y n ) n , <= 
S'fL, if„). Since £(Z, y„) and £„_ t are commensurable, 

^ = '^-1 V .T(X, Xn) = 3^-1 v [.r(X, *„) n 

and we therefore obtain TE n <=. -T(L, iff). Thus we have proven that = T’fE, iff). 
If TC„ — T? from a given value of n, then the theorem is proven. Since \J „TC n = TC. 
E n f-*f for each / e SC. If we set tu = if u ...,ri„ = (E„- £„_ fif n then ST (Z, r\„) = 
TC n n SCf- j. For a„ 0 and ||a„|| 2 < oo we find that 


and, since (£„ — £„_ f)(p = a„(r]J\\rj„\\) it follows that -r\„ e T(L, (p) and we therefore 
obtain Tf = \J n .T(L, r \ n ) <= ,T(L, <p), that is, -T(L, </>) = Tf. 

In the next section we shall obtain the following result as a corollary: 

Th. 1.4. If \\Eif\\ 2 is an effective measure over a complete Boolean ring 
XcG and there exists a cp e Jfffor which -TfL, <p) = TT, then S~(L, 4 1 ) = ST. 

From this result it follows that 

Th. 1.5. If Z is complete, and if there exists a <p e ST for which S' (Z, <p) = ST, 
then to each if s ST there corresponds a unique E 0 e Z for which E 0 ij/ = 0 
and Eif / 0 for all E e Z for which E < 1 — E 0 . In addition, S(L, if) = 
(1 — E 0 )ST, that is, E( Z, if) = 1 — E 0 e Z. 

Proof. We define E 0 as the union of all E e Z for which Eif = 0; since Z is complete 
E 0 e Z; therefore E 0 is the largest £ 6 Z for which Eip = 0. In the subspace ,tT 0 = 
(1 — E 0 )ST we find that Z 0 = {£(1 — £ 0 )|£ e Z} and q> 0 = (1 — E 0 )(p satisfies all 
assumptions of Th. 1.4. Therefore if = (1 — E 0 )>p is a vector from ST 0 for which 
■T(L 0 , i f) — ST 0 , and we therefore find that 

5-(Z, I/O = sr 0 = (l - £o)jr, 

that is, the projection £(Z, ip) which corresponds to .TEL, if ) is an element of Z. 

Th. 1.6. If Z is complete, and there exists a <p e ST for which S'fL, cp) = jf, 
then Z is maximal. 

Proof. Suppose that Z is not maximal. Then there exists a E T 0 which is commen¬ 
surable with all £ e Z. For if e ETC it follows that -T(L, if) <= ESP and £(Z, ip) < E. 
Since if e T(L, if) it follows that Vv>s ejt S'EL, if) = ESP, that is, E( Z, if) = £. 

Since, according to Th. 1.5, £(Z, if ) e L and since Z is complete, then £ s Z in con¬ 
tradiction to the assumption that E Z. 

Th. 1.7. For a complete Z the following two conditions are equivalent : 

(i) Z is maximal. 

(ii) There exists a (p e ST such that S' (Z, cp) = ST. 

The proof follows directly from Th. 1.3 and Th. 1.6. 
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Th. 1.8. //Z is complete, but not maximal, then there exists at most a de¬ 
numerable set of pairwise orthogonal E n e G which are commensurable to 
all elements of E; the Boolean ring I generated by E and by the E„ is maximal. 

Proof. Since the E n are pairwise orthogonal, there can be at most denumerably many 
£„. We now consider sets fpf of vectors for which the ST PL, <p v ) are pairwise ortho¬ 
gonal. Among these sets there are maximal sets, a result which follows directly from 
Zorn’s lemma. Let {<p„} denote such a maximal set. Let = -T(L, (p„) and E n = 
E(L, cp„): let £ denote the closed Boolean ring generated by E and the £„. 

If £ was not maximal, then there exists a £ 0 <£ £ which is commensurable with all 
£ e £. Thus we obtain 


£ ”=? £ a + 4-?4 

where £ 0 £„ = £„£ 0 . 

Since 2T n — 5"(E, <jo„) = ^"(E, £„<p„), the Boolean ring E„ = {££„)£ e E} is, 
according to Th. 1.7, maximal with respect to the Hilbert space .T n \ therefore 
£ 0 £„ e E„ <= £. Therefore £ 0 £„ e £. 

If 1 — £» ^ 0, then there would exist a x where x = (1 — E n )’/- Since 

5-(E, Z ) = r(E, (1 - E n )x) = (1 - L. E n )!T{L, x) it follows that -T(L, x ) is 
J. to all -T„, in contradiction to the fact that the {(p n } are maximal. Therefore £ 0 = 
£ 0 £„ e £ and £ is therefore maximal. 

Th. 1.9. The E n in Th. 1.8 can be so chosen such that from EE n = 0 for an 
E e E if follows that EE m = 0 for all m > n. 

Proof. We shall begin by using the £„ in Th. 1.8, and defining the £„ for which Th. 1.9 
is satisfied. With £ obtained from the proof of Th. 1.8, according to Th. 1.3 there 
exists a (p for which ,T(L, <p) = Since E is complete, the support (IV, Th. 2.1.5) 
of the measure |(££„q>|| 2 = p„(E) is defined in E and will be denoted by £ (n| . We 
therefore obtain E in) e E. We therefore obtain (1 — £ (n) )£„ = 0, that is, £„£ <n) = £„. 
Furthermore, from ££„ = 0 for £ 6 E it follows that £ < 1 — £ (B) , that is, ££ (n) = 0. 
We recursively define: 

£ur = £<0 

£(2)' = £(2) _ £<2)(1 _ £( D ) = £(2)£(1) ; 

£(3)' = £(3) _ £(3) _ £<1) v £(2)^ = £(3)(£(D v £< 2 >), 

£(4)' = £(4) _ £(4)(J _ £(1) v £(2) y £(3 )j = £(4)(£(1) y £(2) y £(3)^ 


and we then define 


£(2)" = £(2 )' f 


£(3)" _ £(3)'£(2)'^ 

£(4)" = £<4>'(£(2>' v £(3)'^ 
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and, in the same way, we obtain £ <3) '”, E w ", etc. 

The set of all projection operators obtained in the above manner are elements of 
£. With the aid of the above, we recursively define the £„: 

E, = E t E a) + £ 2 [£ (2, ( 1 - £ (1 >)] + £ 3 [£ (3) (1 - £ (1) v £ (2) )] 

+ £ 4 [£ (4) (1 - £ (1) v £ <2) v £ (3) ) 4- • ■ • , 

£ 2 = £ 2 £ (2) ' + £ 3 [£ (3, '(1 - £ (2> ')] + £ 4 [£ (4) (1 - £ (2) ' v £ (3) ')] + • • •, 

£ 3 = £ 3 £ (3) ” + £ 4 [£ (4) "(1 - £ <3> ")] + • • •, 


From£„£ <B) = £„and£„£ m = Oforn # m it follows that £„ are pairwise orthogonal. 
We must show that the complete Boolean ring generated by the £„ and £ is maximal 
and, from ££„ = 0 for a £ e E we also obtain EE m = 0 for all m> n. 

Let ££„ = 0, for example, ££ 2 = 0, then it follows that £ 2 ££ (2) '=0, 
£ 3 £[£ <3) (1 — £ <2) )] = 0,... . Thus it follows that ££ <2> < 1 — £ <2) and that with 
£< 2 >' = £ (2) £ (1) the relation EE ay = 0 and then we obtain the relation £ 3 ££ <3) = 0, 
that is, ££ (3) ' < 1 — £ 3 and we therefore obtain ££ (3) = 0. 

Similarly, it follows that ££ (4) ' = 0, etc. Thus it follows that ££ (2) = 0, ££ (3) = 0, 
££ (4) " = 0,... and we finally obtain ££ 3 = 0. Similarly, from ££„ = 0 it follows 
that ££„+! = 0 and therefore ££ m = 0 for all m> n. 

From the definition of £„ it follows that 

£]£ (1) = E t E (1) = £ lt 

£,£ (2) = £j£ (2> + £ 2 (1 - £ (1) ), 

£]£ (3) = £j£ (3) + £ 2 (£ <3) - £ (1) £ (3) ) + £ 3 (1 - E w v £ <2) ), 


and we obtain 


£1 = £1 £ (I> , 

£ 2 (l - £ (1> ) = £!£ (2) - E x E w E m = E^ 2 \\ - £ (1 >), 
£ 3 (1 - £ (1) v E m ) = £!£ <3) - E l E a) E {3) - £ 1 £ (2) £ <3) ( 1 - £ (1) ), 


Similarly, we proceed with £ 2 , £ 3) ... and recognize that the £ lt £ 2 ,... are 
elements of the Boolean ring generated by E and the £ x , £ 2 . Thus the complete 
Boolean ring generated by E and the £„ is equal to that generated by £ and the £„, 
and is therefore also maximal. 


2 Representation of as if 2 (Sp(/l), \i) where Sp(/1) is the 
Spectrum of a Scale Observable A 

The fact that every complete Boolean ring E c G can be generated by a 
scale observable A was explained in IV, §2.5 and emphasized in §1. In §1 we 
have also stressed the fact that these considerations can also be applied to the 
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case in which £ is generated by an n-tuple of commuting scale observables 
A 1 ,..., A n . Since we cannot use Th. 1.4-1.8, we shall, for the present, only 
assume that there exists a cp e for which J"(£, q>) = Jf. Then, for each 
f e .W there exists finitely many £ f e £ and complex numbers a t such that, 
given an e > 0 

n 

f - z ciiEiCp < s 

i= 1 

is satisfied. We may assume that the £,■ are pairwise orthogonal, because we 
may use, instead of the £ f , the pairwise orthogonal atoms of the finite 
Boolean subring of £ generated by the E t . 

According to IV, Th. 2.5.8 we may identify each E e £ with a measurable 
subset k of the spectrum Sp(A) of a scale observable A. Let P denote the set 
of these measurable subsets k. k is uniquely defined by £ up to a set of measure 
0. For finitely many orthogonal E t , since E t A Ej = E t Ej = 0 for i y j, we 
may choose the corresponding subsets fc ; such that fc ; n kj = 0 for i y j. 
Instead of E t we shall write £(/c ; ) (where £(/c) is defined in the sense of a(k) 
in IV, Th. 2.5.8); therefore, to each/ e and to each e > 0 there are finitely 
many disjoint measurable subsets k t of Sp(A) for which 

/- t^Eikd <a (2.1) 

;= i 

For a decomposition Sp(A) = (J" = x k t (k t n kj = 0 for i # j) of the spec¬ 
trum of A into measurable subsets /c, we define a stepwise function g on 
Sp(A) as follows 

g(a.) = y i fora ek h (2.2) 

where the y t are complex numbers. From the step function g(a) we obtain 
a vector from as follows: 

gf(a)-» X 'h E ( k i) ( P- (2.3) 

i= 1 

For a pair of decompositions {fcj and {&'•} it follows that 

(I -6WM I yiEikdv) = I -mwEik-mM 2 - 

' j i / i,j 

Since £(fc))£(fc ; ) = £(&} n k t ), and from the rr-additive measure over P 
given by 

/#) = \\E(k)(p\\ 2 , (2.4) 

we obtain 

\E y'j E ( k 'j)(p, E yt E ( k i)(p / = Z ^ k d- (2.5) 

\ j i / i,j 
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For g defined by (2.4) the set of step functions is dense in the Hilbert space 
J? 2 (Sp(A), n). From (2.3) and (2.5) it follows that (2.3) is an isomorphic 
mapping of this subset into which has a unique extension as an iso¬ 
morphism 

i? 2 (SpG4), ft) 


■ Jf. 


( 2 . 6 ) 


We write this isomorphism (2.6) in the following form (where/ (A) is measur¬ 
able and quadratically integrable): 


/(a) -» / = J/(a) dE(a)(p, 


(2.7) 


where the integral on the right-hand side can be considered to be defined 
by this isomorphism. Thus it follows that, for a sequence of step functions 
g v (y) for which gr v (a) —> /(a) (in the sense of the norm in 2 (Sp04), /*), 
g v = Yji y[ v> £(fc| v) ) is a Cauchy sequence in M' (where the y[ v) are the values 
corresponding to the g v according to (2.2)) which satisfies 


0v -► J/(<*) dE(a)(p. 


We can also choose to use (2.8) as the definition of the integral. 
Here we remark that 


I 


/ (a) dE( a) 


( 2 . 8 ) 


(2.9) 


defines a function of the operator A for measurable /(a); /(a) need not be 
quadratically integrable. The operator B = f(A) is a “normal” operator; its 
definition domain is the set of all i/r e for which J |/(a)| 2 d|j£(a)i^|| 2 < oo. 
If f(a) is bounded, then so is f(A), where the latter is already defined in 
IV, §2.5 (see IV, Th. 2.5.11). 

To each operator B in there exists a corresponding operator B in 
i? 2 (Sp(^), g) defined by the isomorphism (2.6), (2.7) (see AIV, §13) as 
follows: 

/'(a) = Bf( a) -> Bf - j/'(a) dE(y)cp. 

The mode of action of 

A = |a dE(y) (2.10) 

is particularly simple. If we replace the integral in (2.10) by the approximation 
sum 

X «;(£(<*,•) ~ £(<*(- i)X 

i 

then it is easy to show that 


Af(<x) = a/(a). 


( 2 . 11 ) 


Piifie. 7lltt£fccj<t giumf ?fcy 4iai 
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Similarly, it is easy to prove that for a measurable set k e P 

s f/O) forae/c, 

«« = {o for atk. <U2) 

Let iA be a vector in and let i//( a) be the corresponding function in 
«Sf 2 (Sp(A), p). Then, for a finite sum it follows that 

X y i E(k i )<p(ix) -> X 

i i 

and for p(x) defined according to (2.2) we obtain 

<y(a)t/>(a) -> X iMk^. 

i 

The subspace .^~(E, iA) of therefore corresponds the subspace of 
if 2 (Sp(A), p) spanned by all p(a)iA(a) where p(a) are arbitrary step functions; 
let 5~(E, iA) denote this subspace. Thus we find that 5~(E, ijj) is spanned by 
all h(a.)il/(tx) where the h(a) are bounded measurable functions. 

Let ky denote the measurable set 

= {a | a e Sp(^4) and iA(a) # 0}. (2.13) 

Then we obtain 

•^(Z, </0 = {/(«) I /(«) e if 2 (Sp(^), p) and/(a) = 0 for a e SpG4)\/c^} 
and we therefore obtain 


^-(E, >A) = £(^)Jf. 

According to (2.13), (1 — £(fc<*,))<A(a) = 0, and we therefore find that 

(1 - E(k^M = 0. (2.14) 

If the measure ||£iA|| 2 is effective over E then we must have 1 — E(k llf ) = 0 
and we obtain E(k^,)Jt = Jf, that is, ,T (E. \j/) = from which we have 
also proven Th. 1.5 and 1.8. In particular, the operator A we used at the 
beginning must have been a maximal scale observable. 

If iA is, in addition to (p , a second vector for which -T(E, iA) = Jf then with 
the help of (A we may define an isomorphism in the same way as we have 
done with <p as follows: 


^ 2 (Sp(A), p) -> jf, 

/(<*) ->• / = J /(a) d£(a)iA, 

where 

p(/c) = ||£(/c)iAI| 2 = f I *A( a ) 1 2 dp(y.). 


(2.15) 


(2.16) 


Piifie. 7lltt£fccj<t giumf Pfcy iiai 
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For a step function y(y.) of the form (2.2) where we replace y ; by y t it follows 
that 

0(a) -»■ E E 

i i 

where £(/c)t^(a) is given in (2.12). Therefore 

0(a) -> E y. £ ( fc ;) l A ♦- 0(a)iA(a). 

i 

This result can be extended to all /(a) e =Sf 2 (SpG4), y), that is, the iso¬ 
morphisms (2.6), (2.7) and (2.13) lead to an isomorphism 

-^ 2 (SpC4), y) -*■ if 2 (Sp(^4), y), 

/(a) -* /(a) = / : (o#(a). 

We call «S? 2 (Sp(A), /t) together with the isomorphism (2.6) an ^-represen¬ 
tation of the Hilbert space (obtained by means of (p). In an ^-representa¬ 
tion the special simple relations (2.11) and (2.12) hold. Between the 
^-representations obtained by means of (p and i ji the relationship (2.17) is 
satisfied, where i/r(oc) is an arbitrary measurable function which is 
quadratically integrable with respect to y and is nonzero almost everywhere 
on SpG4). According to (2.7) it is clear that in the representation obtained 
by means of <p the vector (p corresponds to the function (p(a) = 1. 

In an /I-representation the probability tr (P f E) may be simply calculated 
as follows for E e Z: To E there exists a measurable set k for which E = E(k). 
Then, according to (2.12) we obtain 

tr (p f E) = </, E(k)fy = jmmm dm 

= f \f(a)\ 2 dy(a), (2.18) 

•Ik 

where (2.18) is the probability that in the measurement of A for an ensemble 
W = P f the scale value of A lies in the subset k. 

Is it possible to extend the formula (2.18) to general W ? For W= E» w v P<i> v , 
w v > 0 and E w v = 1 it follows that tr(WE(k)) = Ev w v tr (P^ v E(k)) and we 
therefore obtain 

tv(WE(k)) = E w v f I<Av(a)I 2 dy{ a). (2.19) 

V J 

What is the form of the operator W in if 2 (Sp {A), y)l For 
Pffi. a) = 'A(a) jV(a')/(a') dy{a') 

it follows that 

Wf{ a) = E w> v (a) f iA v (ot')/(a') dy(cc'). (2.20) 


Piifie. 7lltt£fccj<t giumf Pfcy iiai 
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Since £ v I w v l = Xv w v = 1 there exists a function W( a, a') in Sp(/1) 2 which 
is, for fixed a, measurable in a' where W(ol', a) = fF(a, a') and W(oc, a) > 0, 
so that it is possible to write (2.20) in the form 

/'(a) = F/(a) = | W( a, a')/(a') (2.21) 

(2.19) is transformed into 

tr (WE(k)) = f W( a, a) dn(<j.). (2.22) 

Jk 

From tr(W0 = 1 it follows from (2.22) that, for the special case in which 
k = Sp(A), 

| W(<x, a) dfi(a) = 1. (2.23) 

The representation of W by means of an integral kernel W(tx, a') cannot be 
extended to the case of arbitrary self-adjoint operators B, not even, for 
example, to A in (2.11). In order to overcome these computational difficulties 
we shall present new methods in §§3-5. 

The simple form (2.11) of A permits us to define arbitrary measurable 
functions /(A) as follows: If % is an arbitrary real ^-measurable function on 
SpG4), then we define 

x(A)f(cc ) = *(a )/(a). (2.24) 

%(A) is therefore a self-adjoint operator with domain of definition of all the 
/ (a) for which 

Jlx(a)/(a)| 2 dn(a) < oo. 

This represents a generalization of the function concept introduced in 
IV, §2.5 to the case of unbounded functions. 

In particular, A itself can be an unbounded self-adjoint operator. Then 
Sp(^4) will be an unbounded subset on the real axis. The domain of definition 
of A is then the set of all/ (a) e if 2 (Sp(^), n) for which a/(a) e if 2 (Sp(^4), fi). 

In the ^-representation described above it is customary to define an 
“renormalization” in which the spectrum of A is decomposed into three 
disjoint sets k d , k cc , k sc as described in IV, §2.5 (page 147-148): Sp(^) = 
k d u k cc u k sc . Accordingly fj, is decomposed into /r = fi d + n cc + /r sc where 
nAh) = A (k d n k), fi jk) = jx(k cc n k) and n sc (k) = n(k sc n k). The set k d is 
identical with the set of discontinuity points of the spectral family E(a) of 
A. The set k d is denumerable; let a v denote the points of k d . Let P v = (£(a v ) — 
E( a v —), then we find that E(k d ) = Y, V P V . The projections E(k d ), E(k cc ) and 
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E(k sc ) are pairwise orthogonal, and 1 = E(k d ) + E(k cc ) + E(k sc ). For f d = 
E(k d )f, f cc = E(k cc )f, f sc = E(k sc )f the right-hand side of (2.7) takes on the 
form/ = ft + f cc + f sc where 

/d = Z /(«»)*%«», 


fee = f /(a) dE(<x)cp, 
Jk c 

fsc = f /(a) dE{rj.)<p. 

Jksc. 


(2.25) 


From = ST (Z, cp) it follows that all the P v are one-dimensional projec¬ 
tions, because from (2.25) it follows that, for a vector / satisfying P v f = / 

/ =/(oc v )P v <p, 

that is, all vectors from the projection space of P v are multiples of a single 
vector P v <p. 

The set of solutions of the eigenvalue equation Af = a„/ is given by the 
set of vectors in the projection space of P v . Af = of has a nonzero solution 
only for the values of a. v ek d . If the eigenspace for the eigenvalue a v of A is 
one-dimensional, then this eigenvalue is said to be nondegenerate. From 
.Af = .TfL, (p) it follows that the eigenvalues a v are nondegenerate. 

For <p v = P v (p it follows from (2.25) that 

/=I/(<O0v +f cc +fse. (2.26) 


Since the <p v are pairwise orthogonal and are orthogonal to f cc ,f sc from (2.26) 
it follows that 


<0v,/> =/(a v )ll^vll 2 


and that 


/=E 


<?v 


<Pv 


Il0vll Wvll 


I / / + fc + fsc 


(2.27) 


Here ||<p v || 2 is the measure fi(k v ) where k v consists of the single point a v . 
Equation (2.27) suggests the following renormalization: Instead of using the 
Cp v in the expansion (2.27) we use the normalized eigenvectors <p v = ip v || <p v || “ 1 . 
Then we obtain 


/ — Z < Pv< ( Pv>/) + fee + fsc- 
V 

We now define 

<<Pv,/> = <<* v |/>, 


(2.28) 

(2.29) 


where we consider <oc v |/> to be a complex function defined on k d . We 
obtain 


<«vl/>=/(OV®, (2.30) 
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where fc v is the set consisting of a single point a v . By analogy with (2.30), for 
a 6 k cc we define 

<«!/> =/(«X/pW, (2.31) 

where p( a) is defined over k cc by the equation 

p(k) = p{a) dot for all k c k cc , (2.32) 

Jk 

where dot. is the Lebesgue measure. For a. e k sc we set <a|/> = /(a). In this 
way we obtain a new isomorphism in addition to that given by (2.6) and 
(2.7) as follows: 

^ 2 (Sp (A), p n ) - .JT, (2.33) 

<a|/>-/, 

with the “normed” measure 

yu„(/c) = N(fc n k d ) + f doc + f dp(ot), (2.34) 

Jknk cc Jknksc 

where N(k n k d ) is the number of elements of k n k d . p n is not necessarily 
bounded, that is, p n (Sp(A)) = oo is possible! 

If k sc = 0, then (2.18) takes on the renormalized form: 

tr(P / £(/c))= I K « v |/>| 2 + f \<a\f>\ 2 dtx. (2.35) 

v <Jk 

<x v eknkd 

The existence of a set k sc ^ 0 appears to be peculiar for a physical measure¬ 
ment scale. From the viewpoint of the mathematical arbitrariness of the 
choice of a measurement scale as described in IV, §2.5 we cannot produce an 
argument against the existence of the set k sc . If, in the choice of scales we 
introduce an additional structure to that described in IV, §2.5, we introduce 
new questions about the sets k d , k cc , k sc \ for example, with respect to the 
Hamiltonian operator H as a scale observable, where H describes the time 
translation of the registration apparatus relative to the preparation apparatus 
On the basis of this physical meaning for H the “measurement scale” for H 
attains a new physical meaning (see, for example, X, §1 or XVI, §4.3). 


3 Improper Scalar and Vector Functions Defined on Sp(yl) 

In practice it is often desirable to use different representations of a Hilbert 
space for quantum mechanics. In many respects the distinction between the 
sets k d and k cc and, if it exists k sc , is inconvenient. For this reason methods 
have been developed which permit the points of k cc to be treated as “simply” 
as are those of k d . In this way we also seek to obtain “solutions” / of the 
eigenvalue equation Af = af for points of k c . These / clearly cannot be 
elements of 
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One of these methods is that of the Gelfand triple [1]. A second method 
is that developed by Nikodym [2], In either case there is no “simple” 
accessible method which is sufficiently reliable to permit us to attempt to 
guarantee (on the basis of general theorems) the mathematical validity of 
the computed results. For the most part these methods are used as heuristic 
principles, to obtain quick results, the mathematical validity of which can 
later be verified in individual cases. In concrete examples it is not unusual 
to find unsatisfactory results which do not, however, place the theory itself 
in question. Perhaps, in quantum mechanics, a restriction to the mathematics 
of Hilbert space is inconvenient. Such a restriction is not necessary since 
operators in Hilbert space are only representations of a special Banach space 
structure (see III, §3). In this representation there is an additional underlying 
structure which we have denoted by 3 (see the discussions in VII, VIII). It 
is conceivable that the development of a “theory of 3” could lead to new 
practical methods (see also [3]). 

Since we do not yet have a satisfactory method for the reformulation of 
mathematical problems in 3C we shall now present a brief outline of the 
method of Nikodym in order that we may at least have a general method 
which underlies the somewhat messy mathematical calculations which will 
appear later in this book. 

In order to obtain a greater similarity between the general expansion of 
/ on the right-hand side of (2.7) and the expansion of / with respect to a 
complete orthonormal basis <p v (the case in which Sp(A) = k d ), f = 
£v <p v <<Pv> />» we sha H return to (2.1). 

According to IV, Th. 2.5.8 (and the remarks preceding the theorem), for 
ke P we obtain 

00 

E(k) = A V E(h), (3-1) 

U V = 1 


where /\ u is to be taken over all coverings u of k. For each (p e \/ v A, £(/„)<p 
may be approximated in the norm to arbitrary accuracy by \/v'=i £(/ v )<p. 
Here we may assume that the intervals / v are mutually exclusive because we 
may easily replace V^=i by VJT w ' l h disjoint l\ L . From (3.1) it 
follows that E(k)(p can be approximated arbitrarily well by V“=i £(/ v )<p for 
a particular covering u. Therefore, E(k)<p can be arbitrarily well approximated 
by Vv= i E(Iv)<P where the intervals 7 V are disjoint, that is, it can be ap¬ 
proximated by the sum £?= j E(I v )(p. 

From (2.1) it follows that each / can be approximated by finite sums 
Y, p a P E(I p )(p where the I p are disjoint intervals. Therefore, to each e > 0 
there exists a partition of R into finitely many disjoint intervals I p and there 
exists a finite set of complex numbers a p such that 


/-Z«pW> 

p 


< £. 


Let I" be, for fixed n, a partition of R into intervals where d n is the maximum 
length of the intervals and suppose S n -> 0. Then there exists for each e > 0 


7lltt£fccj<t giumf Pfcy iioi 
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a 5 n and a set of numbers a" such that 


/-£<£(/> 

v 


< £. 


Since (for fixed n ) the £(/") are pairwise orthogonal, \\f — a"£(/")<p|| will 

take on its minimum value (see AIV, §2) when 


Therefore we obtain 


<£(/>,/) 
l|£(/>ll 2 ' 


lim X 


<£(/>,/) 

l|£(/>ll 2 


= / 


(3.2) 


for each / e Jf. For the case in which A has only a discrete spectrum, with 
a v denoting the elements of k d and £(a v ) - £(a v —) = P v (see (2.27)) we 
obtain 




<Fy<?,/> 


= /• 


(3.3) 


For (p v = P v <p||P v <p|| (3.3) is an expansion of / with respect to the normed 
eigenvectors <p v of A. We may write <p v as the limit as a' < a v as follows: 


(£(q v ) - E(y.'))<p = P v <? 

JZ ll(£(«v) - £(«'))<PII IIPv^ll 


(3.4) 


In general, if a v is not a point of discontinuity of the spectral family £(a) then 
the limit of 


= (£(a) - £(«'))<? 
||(£(a) — £(a'))<pj| 


(3.5) 


does not exist for a'-» a, otherwise linv_ a %**• would be a normalized 
eigenvectoY of A with eigenvalue a. 

If a! is close to a then will approximately represent an eigenvector of 
A since 

< tx - a'. (3.6) 

We obtain (3.6) from 


= ||(£(a) - 


EWto ir 1 f 

J<x' 


(a" — a) dE{a!')(p 


and from 


MXca' - 0%JI 2 = II (£(<*) - E(x’))(p\\ 2 f (a" - a) 2 d||£(a'>|| 2 

* a.' 

< ||(£(a) - £(«'))<?> rV - a) 2 ||(£(a) - £(a'))«p|| 2 
= (a' — a) 2 . 





16 IX Representation of Hilbert Spaces by Function Spaces 


Equation (3.2) says that, for sufficiently large n each / can be expressed to 
arbitrary accuracy by an expansion in terms of “approximate” eigenvectors 
% m ' of the form (3.5). This result motivates the following definition of an 
“improper” vector function 2(a) on Sp(A). 

For fixed a, 2(a) is a mapping of the interval — oo < a' < a into that 
is, there exists a function x(a, a') which is defined for all (a, a') for which 
a' < a where x(a, a') e Jf; 2(a) is defined as the function /(%, a') as a function 
of a' for fixed a. 

Similarly, an improper complex function a(a) on Sp(A) is defined with the 
aid of a complex function a(a, a'). 

For/ e Jf, (x(a, a '),/> = a(a, a') defines, for an improper vector function 
2(a), an improper complex function 5(a). Flere we write a( a) = <2(a) ,/). 

We may also define improper functions for the case of several variables. 
For example, we may define an improper vector function x(xi, a 2 ) by means 
of function x(oc ,, a); a 2 , a 2 ) for a) < a u a 2 < a 2 where x( a i, aj; a 2 , rj 'i ) e 

An improper function fj(a 1 ,a. 2 ) is defined by ^(oq, a) ; a 2 , a 2 ) = 
a(oq, aV)/(a 2 , a 2 ); here we write fj( a ls a 2 ) = 5(a 1 )2(a 2 ). Similarly, we define 
a(ai, a 2 ) = <2(a t ), Zfe)) by <z(a l5 a)), /(a 2 , a 2 )>; similarly, we may define 
K a i) + z( a 2 )> etc.. Note, however, that we can define an improper function 
of only one variable a(u)x(oi) by a{ a, a')y(a, ot'). Then 

a(a)Ha) = 

does not hold! For an improper function fj{u u a 2 ) of two variables we cannot 
simply set a t = a 2 instead, we must write [? 7 (a 2 , a 2 )] ai =ct2 . Here we hope that 
no misunderstanding will result from the following formulae. 

Let k be a measurable set of the spectrum; let us consider, as in the case of 
IV, §2.5, a covering u of k by means of countably many intervals I v :k c= (J v / v . 
From such a covering we may select a covering consisting of disjoint intervals 
as follows: Choose I n = I 2 + I x is then a set which consists of finitely 
many disjoint intervals, so that J 2 u I % may be represented as the union of 
the disjoint intervals 7 X j and those in I 2 + /,. If (J™ = { may be represented 
as the union of disjoint intervals, then the same is true for I m+1 + (J”=i 
In this way we recursively prove our assertion. We need only consider the 
coverings consisting of disjoint intervals instead of all possible coverings. 
Since k is measurable, there exist sequences of intervals /" such that E n = 
£(/") is a projection operator and E n -> E(k). 

We could also assume that the maximum length S n of the intervals I" tend 
towards zero; to do so requires only that we divide the intervals I". 

An improper vector function 2(a) is said to be summable over Sp(A) if for 
every measurable subset k of Sp(A) the limit %(/") exists and is independent 
of the sequence of coverings u n = 1J V /" of disjoint intervals for which 
d n -» 0; for I : a' < ■ ■ ■ < a we have written /(/) instead of ^(a, a'). 

For a summable vector function 2(a) we write, for the definition 

f 2(«) = limZz«). (3.7) 

Jk n~* oo v 


Pu^ie. 
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We note that, according to the definition (3.7), only the points a e Sp(^4) 
occur in the sum J, since k must be a subset of Sp04). If the spectrum is 
discrete, that is, Sp(T) = k d , it follows that for a summable function 2(a) the 
limit lim e ^ 0 /(a v , a v — e) = % v e exists for the points a v e Sp(^4), where 
the latter follows from the fact that (3.7) is satisfied for k = {a v }, that is, for 
a set consisting of a single point a v ; from (3.7) it follows that for an arbitrary k: 

|*2(a) = I Zv (3.8) 

•'k a v ek 

If E(I)x(I) = %(I), then, from (3.7) it follows that for 

9 = f 2(a): f 2(a) = E(k)g. (3.9) 

Jsp (A) Jk 

The improper vector function 2(a) defined by /(/) = E(I)f for fixed 
/ e is summable, and trivially satisfies the equation E(I)x(I) = x(0- We 
obtain: 



J^2(a) = E(k)f. 


In this case j Sp(/)) 2(a) is nothing other than an alternative notation for 


dE(a)f. 


Two improper vector functions 2(a) and fj(a) are said to be equivalent 
(written 2(a) ~ fj(oi)) if, for all measurable k, the following equation is 
satisfied: 



(3.10) 


From the above 4erivation it follows that, for example, the improper vector 
function 2(a), defined in terms of £(/)/, is equivalent to the improper vector 
function fj( a) defined by means of 


E(I)<p 


II £(/)<? II 2 ’ 


<P(0C):= 


E(I)cp 

\\E(IM 


(here ~ means that the expression on the left is defined by the expression to 
the right of the equal sign) is an improper vector function, and satisfies 

fj(a) = <p(a)<<p(a),/>. (3.11) 

We therefore obtain the expansion 

f = (3-i2) 

JspM) 
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The improper vector function <p(oc) itself need not be summable; but the 
improper function rj(a.) given by (3.11) must be. 

If Sp(,4) is discrete, then (3.12) becomes 

£ *»<*,>/> = /. 

v 

where i// v are the normed(!) eigenvectors of A: 


, = ,. (£(« v ) - £(«v ~ £))<?> 

^ . “ ||(£(a v ) - £(a v - e))<p|f 


(3.13) 


Let p(k ) be a positive c-additive measure over the set P of measurable subsets 
of Sp(d). It is not assumed that E{k)q> ^ 0 implies that p(k) / 0. In addition, 
p(k) need not necessarily be normalized; it is permissible that p(Sp(d)) = oo. 
However, for all finite intervals / for which £(/) # 0 we assume that p(l) # 0 
and p{I) is finite. Thus an improper function is defined by p(I), which we 
denote by dp(oc). 

For a partition of the a-line into intervals we obtain: 

£(/> / E(I,)<p \ 

v \\ e ( i,m \ii£(/>irV 

vM^fpU 


= £ 


£(/> 


I|£(/>UVp(£) Nil£(/ 


y/)p(U 


For the improper vector function 


<?(a) ••= 


E(I)(p 


II £(/)<? II y^T) 


we obtain, in the limit 


/= f iA(a)<iA(a), /> dp(a). 

JSp(A) 


(3.14) 


(3.15) 


It is customary to separate the discrete spectrum from the continuous spec¬ 
trum: Sp(y4) = k d KJ k c where k c = k cc u k sc (k c = Sp c (d), k d = Sp/,4) in 
IV, D 2.5.4). For F v = £(a v ) — £(<x v —) (a v are the points of the discrete 
spectrum) we define 

H(a) = £(«)- £P V (3.16) 

a v ^a 

and 

^4 = A d 4- /4 C , (3.17) 

where 


= £ a v p v 

V 


and A c = 



(3.18) 
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We define the improper vector function ijr c (c/.) in a similar manner as (3.14) 
substituting E(a) for E(o c). For i ji c defined according to (3.13) we obtain, 
instead of (3.15), 


/=£ ^<*Av,/> + f &(« )<£(«),/> rfp(a)- (3-19) 

v •'k c 

Both forms (3.15) and (3.19) are, in principle, possible; for “physical” pur¬ 
poses (3.19) is more useful, because in (3.15) the discrete part of the spectrum 
is difficult to determine. In order to make the formula more transparent, we 
set A d — 0, that is, k d = 0. 

For this case we shall discuss (3.15) further. 

We now choose p(k) = p(k) where p is defined by (2.4). Then we obtain 

(320) 


where/(a') is the ^-measurable function which corresponds to/. Thus it 
follows that, in the limit e -> 0, 


lim 


(E( a) — E(a — e))cp 
o\l!( £ ( a ) - £ ( a - £ ))9>H 2 


/> = m 


(3.21) 


holds modulo sets of measure zero. For p = p we may replace the improper 
vector function </(a),/> by the /^-measurable (proper) function/(a) given 
by (3.21). 

If we choose p{k) = l(k) where l is the Lebesgue measure (where we prefer 
to use da instead of dl(a)), we may not, in general, expect that in the limit 
e -* 0 </(a),/> will be equal to an /-measurable function/(a). This is the 
case only if the spectrum of A is absolutely continuous. If this is the case, 
then for a suitably chosen /-measurable function k(a) we obtain 


p(I) = j" k(a) da. 

Then we obtain almost everywhere 

" f< “ x 

where 


ll/ll 2 = j\f(u)\ 2 dp(a) = J|f(a)| 2 da. 


(3.22) 

(3.23) 


Equation (3.15) is always correct. In / measure we obtain 
</, 9"? = J</, iRa)></(«), 9) da 

= J" f(a)g(a) da, 


(3.24) 
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where the final equal sign = holds only if the spectrum of A is absolutely 
continuous. Then, instead of (3.22) we write 

<#(*),/> = f(«). (3-25) 

An improper vector function %(a) is said to belong to the domain of definition 
of an operator A if the vectors y(a, a — e) belong to the definition domain of 
A for all s which satisfy 0 < e < g 0 for some e 0 . The above <p(a), $(a) belong 
therefore to the domain of definition of A. If we substitute the vector Af for 
/ in (3.12) and observe that <<p(a), Af > = <A<p(a), />, we obtain 

A f=j $>(<*)< A(p(a),f). 

We will now show that 

<p(oc)<A<p(a),/> ~ <p(oc)a<<p(a), />: 

| <04 - al)£(/)<p,/>| = \{(A - al)£(7)<p, £(/)/> | 

< II (A - <xl)E(I)(p |! ||£(/)/||. 

According to (3.6) we obtain 

II (A - c/A)E(I)(p\\ < (a - c/’)\\E(I)(p\\. 

Let S be the maximum length of the intervals / v , then 

y £(/> / Ejlfcp \ E(If<p / E(I v )cp \ 2 

^||£(/>||\ ||£(/>||’V ^ ||£(/ v )^ll av \||£(/ v )^||’- / / 


= E 


V 


(■A - M) 


E(I v )cp A 2 


<5£\\E(I v )f\\ 2 = d\\f\\ 2 . 

V 


For <5 -» 0 we obtain 


A f=j <P( a ) a <<P(a),/> = JtKa)a<i£(oO,/> da. (3.26) 

We may therefore replace A(p(a) by a(p(a) and Af(a) by a$(a), respectively; 
here we use the notation A<p(a) ~ aq>(a), A{j/(u) ~ a\J/(a). We call Cp(a) or 
\}(a) the improper eigenvectors of A. 

From the continuity of the inner product and, from (3.15), it follows that, 
for p — l: 

W),/> = J<■£(«'), &«)><#(«),/> da. (3.27) 

Therefore <i£(a'), $(a)> is an improper function of the two(!) variables 
a', a. This function is called the Dirac delta function, which is often written 
<5(a' — a) instead of 5(a, a’). 

For the construction of the <p{ a) and f/(a) we have made use of our knowl¬ 
edge of the spectral family £(a). If the spectral family £( a) is not known, then 
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we may often succeed by proceeding as follows: From 

£(«)/= f <£(« 'Kfc«'),f>da (3.28) 

we seek (often heuristically) to guess /(a) as an improper eigenvector for A 
from the equation Aiij/(a) ~ ai^(ot) and then construct the operator E(a) from 
(3.28). If, for example, A = (1 /i)(d/dx) is an operator in the Hilbert space 
defined on quadratically integrable functions over x from - oo to oo then 
we seek solutions of the equation ( \/i)(d/dx)g(a , x) = ag(a, x). We obtain 
y(x, x) = ae mx . The g(a, x) are not vectors(!) in since |? x \g(a, x)\ 2 dx is 
not convergent. But the |“_ 8 e ix x da.' are vectors in and define an improper 
vector function y(x, x). With the correct normalization we can set 

/(a, x )••= -j= f e* x da. (3.29) 

yj 2,71 J a — s 

Thus, from (3.28) we obtain 

£(<x)/(x) = JL J e ia ' x ^ e' ia ' x ' f(x r ) dx'^j da'. (3.30) 

The representation of the vector / in terms of the improper function 
<$(a), /) (or, if it is permissible, in terms of thef(a)) is called the ^-represen¬ 
tation. It is not uniquely determined, because, instead of \j/(a) we could have 
chosen e igfx) \l/(a) as the improper vector function providing g(a) is chosen 
subject to the requirement that the appropriate integrals exist; instead of 
<$(<*),/> we get the improper function e~ ig<x) (4f(a),f ) to represent/. 

If B is another maximal self-adjoint operator and x(P) is the corresponding 
set of improper eigenvectors of B, then we may obtain an expansion of the 
l(P) with respect to the /(a) as follows: 

X(P) = J'/'(a)<a|/?> da, where <oc|yS> = <$(oc), *(/)>. (3.31) 

<a | /> is therefore an improper function of the two variables a and ji. Accord¬ 
ing to (3.31) and (3.15) (with l instead of p) (a\fi) are nothing other than the 
improper eigenvectors y(fi) of B in the ^-representation. In the A-represen- 
tation we obtain (where we use the same symbols for the operator B): 

B( a|/> ~/<a|i?>. (3.32) 

For </f | a) = <x|/i> = i J/(a)) it follows that 

<£(«) = fm<P\«>dp, (3.33) 

so that, for the operator A in the B-representation we obtain 

A(P\a) ~ a</|a>, (3.34) 

that is, the </?|oc> are the improper eigenfunctions of A in the B-representa¬ 
tion. 
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A bounded operator D can be characterized by the matrix 
<v|D|/r> = <<p v , Dcp^y 

with respect to a complete orthonormal basis cp v . The <v|D|^> can be 
considered to be the expansion coefficients of Dcp 

D<Pn = 5>v<v|D|M>- 

V 

We may obtain a corresponding matrix for D with respect to an ^-representa¬ 
tion as follows: 

£)$( a) = J*$(a'Xa' | D | a) da.', (3.35) 

where 

<a'|D|oc> = <$(a'), Z>t^(a)>. 

We may express the operator D in the ^-representation as follows: 

D<ta/> = J<a |D|a'X$(a'),/> da'. (3.36) 

According to (3.35), <a'|D|a) is an improper function of the two variables 
a', a. For the 1-operator we obtain the following special case: 

<a' 111 a) = <tKa'), <?(a)> 

= c>(a, a') = 5(a — a'). (3.37) 

4 Transformation of One Representation into Another 

At the end of the previous section we have considered two different repre¬ 
sentations of a Hilbert space, the A- and ^-representations. We shall now 
introduce an elegant notation, due to Dirac, which facilitates the transforma¬ 
tion from one representation to another, and simplifies the calculations, 
providing that it is clear what is meant, in mathematical terms, by the 
notation. 

The simplification arises from the fact that, for the case of the discrete 
spectrum as well as for the continuous spectrum of A, the same symbol | a) 
is used for the proper eigenvectors and improper eigenvectors of A, instead 
of i// v for the eigenvectors and ij/(a) for the improper eigenvectors of the 
continuous spectrum. The real number a in | a) denotes the corresponding 
value in Sp(A). If / is an arbitrary vector in Jf we then write | /) instead of 
/. With the new notation the expansion of / with respect to the proper and 
improper eigenvectors of A is written: 

|/> = X|a v ><a v !/>+ f|a><a|/> da, (4.1) 

V J 

where <a v |/> = (a |/> = <$(a),/>. The sum in (4.1) is taken over 

all discrete eigenvalues, that is, over the discrete spectrum of A. 
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To each vector / e Jf there exists a linear form 1(g) = </, g) on J*f; we 
denote this linear form by </1. The value of this linear form for a g e is 
</|c/> = </, gr>. All bounded linear forms over are of the form </1 for 
a suitably chosen / (see AIV, §4). The set of </1 is therefore identical to the 
Banach space which is dual to (see AIV, §4). 

For improper eigenvectors | a) we define an improper linear form <a | by 
<ot|gf> = <$(«), g > for g e 

On the basis of the results of §3 it is not necessary to provide additional 
explanation of the new notation. We shall only provide a few examples of 
the relationship between the new and old notations as follows: 


= l/X/l 

for / eJf satisfies ||/|| = 1; 

(4.2) 

£(«)= I 

KX“vl + 

| \a}(a\da; 

(4.3) 

a v ^a 




> 

Ki¬ 

ll 

a vX a vl + J 

a|a) <a| da. 

(4.4) 


In the A-representation the vectors / e are represented by “functions” 

<a|/>. We then find that 

</,0> = £ </l ot v>< l *vl0> + f</1 a ><<*!#> da. (4.5) 

V J 

Clearly </1 a) = <a | /). In the A-representation, from (3.26) we obtain 

A<«|/> = a<a|/>, (4.6) 

that is, A is the operator which multiplies the function <oc|/> by a. If D is 
an operator which admits a matrix representation (for example, if D is 
bounded) then in the A-representation we find that: 

0<a|/> = £<a|D|a v ><a v |/> 

V 

+ j’<a|D|a'><a'|/>da'. (4.7) 

If £ is a second maximal self-adjoint operator, then, by analogy with (4.1) 
we obtain 


For 


l/> = II^>OW> + 


Jl PXP\f>dp. 


\p> 

| a) 


£ KX<*vl/?> + |a><a|#> da, 

V J 

Ei0„x/g«>+ [\p><p\* >dp, 

n J 


(4.8) 


(4.9) 


Piifie. 7lltt£fccj<t giumf ?fcy 4iai 



24 IX Representation of Hilbert Spaces by Function Spaces 


it follows that </?|a> = <a|j 3) and 

</*!/> = E<j8|«vX«vl/> + f<y5|a)<a| f) da, (4.10) 

V J 

<a|/> = £<a|/U<iW> + U*\PXP\f>dp. (4.11) 

n j 

Equations (4.10) and (4.11) are the transformation equations between the 
A- and ^-representations and vice versa. 

The < /? | a) are the (proper and improper) eigenvectors of B in the ,4- 
representation 

501/?> = /?<* I £> (4-12) 

and </?|a> are the eigenvectors of A in the B-representation 

A<p\oc) = a(p\a). (4.13) 

Here we use the equal sign = instead of the equivalence symbol ~ (in the 
sense of §3 for improper functions). 

For the matrix of the operator D we obtain 

0|D|a'> = E^I^XiSJDI^X/Spla') 

P>P 

+ 1 U*\P>XP l .\D\P><P\<*>dP 

H J 

+ E (<a\P><P\D\p p }(p p \a'>dp 

P J 

+ jj<*\p><p\D\pxp'\tfydpdp. (4.14) 

The ensemble operator W e K (because it is bounded) corresponds to a 
matrix (a\W\a'} in the ^-representation. For a arbitrary complete ortho¬ 
normal basis rj v , noting that 

Z <«l»f v ><»lv|a'> = S(a,a'), 


the physically important quantity lr(WF) is given by 

tt(WF) = JJX <>? v |«><a| WF |a')<a'| r, y ) da doc ' 

= j"J 5(a, a')<a \WF\a'} da da’, 

where JJ means that we take the sum over discrete values and the integral 
over continuous eigenvalues a, a’. We therefore obtain 

tr (WF) = £<a v |IFF|a v > + JJ<5(a, at')<a| WT|a'> da da'. (4.15) 
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We will now use (4.15) to illustrate the computation of probabilities. 

Let £(x) denote the spectral family of the operator A, that is, in the ,4- 
representation we have 


£(«')<« I/> = 



if a < a', 
if a > a'. 


For a 2 < X; we shall denote £( 04 ) — £(oc 2 ) by £(/). Suppose that an en¬ 
semble W is prepared for which tr( £(/)) = 1, that is, for the ensemble W 
a measurement value a e / for the scale observable A is obtained with cer¬ 
tainty. Therefore we find that W = E(I)WE(I), and (4.15) is transformed 
into 

tr (WF) = Z <oc v |W£|a v > + f ‘ f '<5(a, a')<a| W£|a'> dot. da 1 . (4.16) 

V Ja 2 J 0C2 

a v e/ 

If £ is a maximal scale observable and F is the corresponding projection 
such that the measurement value of B falls in the interval J: P2 < ' < Pi, 
then it follows that 


<a" | £|a'> = Z <a"|j8„Xj8>'> 

Pn eJ 

+ f 1 <a"|j 8 ><jS|a')d)?. (4.17) 

Jp 2 


Using (4.17) we may calculate (4.16) in the general case. Since the latter 
formula is more encompassing, we will explicitly state only certain special 
cases which illustrate the general procedure. 

Let us consider the special case W in which the measurement value of A 
does not have any dispersion. Then W = P Vv for some <p v where q> v is the 
proper eigenvector for A with eigenvalue a v . Thus it follows that 
<a v | W|oc v > = 1, and the remaining matrix elements of W are equal to zero. 
Therefore it follows that 

tr(P„ v £) = <a v |WF|a v > 

= <a v | W|a v ><a v |£|a v > = <a v |£|a v > 

= Z <avl^X^I«v>+ fVl^><j 8 |«v>dA 

ti 4 / s 2 

rPx (4.18) 

tr(P^F) = Z l< a vl^>l 2 + K«vlyS)l 2 dp. 

II J02 

K a vl^)l 2 is therefore the probability for the discrete value /?„ and 
| <a v | y 8 > | 2 is the “probability density” for the continuous values of the spec¬ 
trum of B. 
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Suppose that the interval I for which E(I)WE(I ) = W is such that I does 
not contain any discrete eigenvalues of A. Then from (4.17) and (4.18) it 
follows that 

tr (WF)= £ P'da f'daX^laXal^la'Xa'I^) 

fj. *a.2 

Pn e J 

+ {"'dp [dot [ W</?|a><a|IF|a'><a'|j8>. (4.19) 

* @2 *&2 

Of particular importance for the evaluation of (4.19) are the “kernels” 

KJa', a) = <7.'|/i„></^| a>, ^ 

Kp( a', a) = <a'|)S>< j 8|a>. 

For the measurement values of the discrete spectrum we obtain the 
probability 

= [ dot [ da.'K Jot.', ot)<ot| W\cx!'y (4.21) 

J0.2 *0-2 

and for the continuous spectrum we obtain the probability density 

w(p) = [ ‘dot r‘da'^(a',a)<a|lF|a'>. (4.22) 


In general little can be said about (4.21) and (4.22). If the interval 
/: a 2 <•••< otj is small, then there can exist values of p^ and P for which 
KJa', a) and KJa', a) are, for all practical purposes, constant for a 1 , a el, 
that is, for an ot 0 in I we obtain: 

KJa', a) « |<j3Ja 0 >l 2 , KJa’, a) * \ <P\<*o>\ 2 - (4-23) 

For such p^ or p (4.21), (4.22) are transformed into 

= |<)SJao>l 2 c w , (4.24) 

HP)=\<P\« o)l 2 c w (4.25) 

with constant 

c w = Pda r‘da'<a|IF|a'>. (4.26) 

Jet 2 *<*2 


Equations (4.24), (4.25) then show the dependence of the probabilities , 
or probability density w(P), upon the values of the spectrum of B. Equations 
(4.24), (4.25) hold only for those p „ and p for which (4.23) is a good approxima¬ 
tion for a', a-e I. This cannot be the case for all p u , p because the following 
equation must be satisfied: 

Y [ da [ da.'K Jet', a)<a| JF|oc'> 

H Ja.2 

/*oo /»ai pa. i 

+ dp da da'KJa', a)<a| IF|a') = 1 

j ~ 00 J&2 J&2 
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and (4.23) would lead to the result 


£l<0>o>l 2 + J Kj8|«o)l 2 d/f| = 1 




in contradiction to the “normalization” equation 


£<a|/*M></g«'>+ f <<x\PXPW}dp = d(a,a') 

H J — oo 

(where a and a' lie in the continuous spectrum of A) from which it follows 
that 

IK/*»I 2 + f K^l a >l 2 dp = oo. 

fi J — oo 

Nevertheless, the physical interpretation of </?|a> is most intuitively evident 
in (4.18), (4.24), (4.25) providing that (4.24) and (4.25) are used with suitable 
caution. 


5 Position and Momentum Representation 


This section will serve to illustrate the discussion in the previous section by 
means of special examples. The standard example, the so-called “harmonic 
oscillator,” is particularly suited to illustrate those aspects of the structure 
of quantum mechanics which will be considered in X, and is of considerable 
importance also for “realistic” problems. As an approximation, the harmonic 
oscillator plays a large role for molecular spectra (see XV) and provides the 
basis for quantum field theory (Fock representation, see, for example, [4]). 

The position and momentum representation was originally obtained 
from the representation of the Galileo transformation, as we have outlined 
in VII, §2. The spaces jS? 2 (R 3 , dx x dx 2 dx 3 ) and S£ 2 (R 3 , dki dk 2 dk 3 ) describe 
the position and momentum representations, and are isomorphically 
connected by the Fourier transformation. Here we shall treat the same 
problem from the other side in order to illustrate the methods of previous 
sections. Here we shall consider the “one-dimensional” case of a position 
operator Q and a momentum operator P which satisfies the commutation 
relation (see VII (4.22)) 

PQ-QP = \ 1. (5.1) 


An “harmonic oscillator” is defined by the Hamiltonian operator 


»r 1 war 

"-2^ + “«' 


(5.2) 


which corresponds to an elementary (one-dimensional) system of mass m 
in an external field of the (one-dimensional) potential {mcj 2 /2)Q 2 (for external 
fields see VIII, §6). 
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We now introduce new operators 
Q' = y/mcoQ, 


We obtain 


F = 


1 



P. 


P'Q' - Q'P' = T 1 

i 


(5.3) 


(5.4) 


and 


H = coH', where H' = &P' 2 + g' 2 ). (5.5) 

It suffices, therefore, to find the solution of the problem for Q', P', H', and 
the solution for Q, P, H is obtained by simple computation. In order to 
simplify the notation, we shall omit the ' in Q', P' and H'. 

We shall now make the following assumptions: 

(1) The linear manifold if = P)™=i.m=i 0^p» n @Q") (where 3> is the 
domain of definition of the operators) is dense in 

(2) PQ-QP=-i 1 in if. 

(3) H = j,(P 2 + Q 2 ) is a self-adjoint operator with spectral decomposi¬ 
tion E(e) and E(e)f e 5£ for all / e 

The assumptions (1), (2), (3) are satisfied for the position and momentum 
operators introduced in VII, §4. Conversely, if (l)-(3) (where (3) can be 
weakened somewhat) are satisfied for the position and momentum operators, 
then the infinitesimal transformations corresponding to P and Q correspond 
to a representation of the Galileo group. We refer readers who are interested 
in these relationships to [5]. 

Assumption (1) permits us to construct arbitrary products of the form 
pziQhpziQih Therefore we find that ?£ c S/j h . 

We define the operators: 


A = ~ (Q + IP) and 

x/2 


B = ~(Q~iP). 

72 


(5.6) 


In if we therefore find that B — A + , and that the following relationships are 
satisfied: 

AA + - A + A = 1 and H = A + A + |l. (5.7) 


Therefore, in if instead of H we may consider the operator N = A + A. From 
assumption (3) we obtain: 

N = P A d£(2), (5.8) 

j — 00 


where E(X) is the spectral family for N ; from the latter we may easily obtain 
the spectral family for H. 
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The spectrum of N cannot be negative, since 
</, Nf) = 114/1 2 > 0 

for all / in if. 

According to Th. 1.3 (even in the case in which N is not maximal) there 
exists a tp such that 

(£(a) - E(P))(p # 0 for E(a) - E(fi) # 0. 

If A is a value from the spectrum of N, then for the vectors 

a 1 = (Sw - E(A - e))(p 
x ’ SJ II (£(A) - E(X - e)M 

we obtain 


Nx(A, s ) - A/(A, s ) -> 0. 

Conversely, if we are given a set of vectors /(e) for which ||/J > S and 
N-/,; — A'Xe -*■ 0 then X is a point of the spectrum of N; on the other hand, if 
A’ is not a point of the spectrum, then there exists an interval A 1 — r\ ■ ■ ■ X + q 
for which E(A + q) — E(A — rj) = 0, and for every vector / we would find 
that || (N — AM)/1| > t]\\f\\. From (N — AM)/(e) -+ 0 for / = /(e) it would 
follow that x(s) -> 0 in contradiction to the assumption that ||/(e)|| > S. 

From (5.7) it follows that in JX the following relationships are satisfied: 
AN - NA — A and A + N — NA + = —A + . Since the /(A, e) lie in if we 
find that 


and 


NAx(A, e) = ANx(A, e) - M/(A, e) 


NA X (A, s ) - (A - 1 M/(A, e) = A(N - A1)/(A, e). 

For (N - A1)/(A, e) = h we obtain 

\\Ah\\ 2 = (h, Nh > = <h, (N 2 - AN) X (A, e)> 

< \\h\\ ||(iV 2 — AN)x(A, e)|| < \\h\\Ae 

(the last relation is obtained in a manner similar to (3.6)). Therefore we 
find that Ah -> 0, that is, 

(N - (A - 1)1 )A X (A, e) 0. (5.9) 

In a similar way it is possible to show that 

(IV —(A + l)lM + *(A,e)->0. (5.10) 

Since || A + x(A, s)\\ 2 = </(A, e), (N + 1)/(A, e)> = 1 + \\A X (A, e)|| 2 it 
therefore follows that if A is a value of the spectrum of N then so is A + 1. 

If ||M/(A, e)|| >5 for all e, then A — 1 will also be a value of the spectrum 
of N. Since the spectrum of N is nonnegative it follows that there must be an 
integer n for which there exists a sequence s v -> 0 such that A n+ */(A, e v ) -+ 0 
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and \\A"x(A, e v )|| > <5. In addition (N — (A — n)l)A"x(A, e v ) -> 0 must be 
satisfied, that is, A — n must belong to the spectrum of N; in addition 
(AT — (A — n — 1)1)j4" +1 x(A, e u ) -> 0 and therefore NA n+1 x(A, e y ) —<• 0. 
From NA n x(A, £ v ) = A + A n+1 x(A, s v ) and \\A + A n+1 x(A, e v )|| 2 = 
\\A n+1 x(A, e v )|| 2 + (A n+1 x(A, e v ), NA n+1 x(A, s v )) it follows that 
A + A n+1 x(A, € v ) -> 0, that is, NA n x(A, e v ) -> 0. From (N - (A - n)l)A"x(A, e v ) 
—► 0 it follows that (A — n)A”x(A, e v ) -> 0. Since \\A”x(A, e v )|| > <5 we must 
have A = n, that is, A is a positive integer n > 0. Therefore the spectrum N 
is discrete, and can only contain integers n > 0. 

Since NA n x(A, e v ) -> 0 and || x(A, e v )|| > S the value 0 is a value in the 
spectrum of N. Therefore there exists an eigenvector </> 0 of N with eigenvalue 
n = 0. We may assume that ||$ 0 || = 1. From N</> 0 = 0 and from ||1V</> 0 || 2 = 
|| j 4 + AI </> 0 || 2 = \\A(j) o || 2 + ||^ 2 0 o || 2 it follows that 

A4> o = 0. (5.11) 

Since Ncj) 0 = 0 <j> 0 , it follows that, for x(A, s) = (f) 0 in (5.10), by recursion 
N(A + ) n (j) 0 = n(A + ) n (j) 0 . The spectrum of N consists of all integers n > 0 
and (A + ) n <f) 0 is an eigenvector of N with eigenvalue n. 

We set 

4>„ = a„04 + )"4> o, (5.12) 


where a n is the normalization factor. From the normalization requirement 
\\(j) n \\ = 1 it follows that 


1 = ll^ll 2 = 


a n- 1 




<<t>n-l,AA + 


+ l)^> n -i> = n 


a„-i 


Therefore the </>„ will be normalized if we set a„ = 1/ v 4jT; we obtain 

<Pn = -T=t(^ + )"^0> 


(5.13) 


and we obtain 


N(f>„ = n(j)„. (5.14) 

Since the (j) n are eigenvectors which correspond to different eigenvalues, they 
are pairwise orthogonal. 

If the eigenspace of N for the eigenvalue 0 is multi-dimensional, then we 
may introduce a complete orthonormal basis (f> ( Q in this eigenspace. We then 
define 


= 



(A + r^\ 


(5.15) 
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The (f> (k) are normalized and are pairwise orthogonal, that is, they are an 
orthonormal set in jf. We will now show that they are complete. If this is 
not the case, then there exists a smallest ri > 0 for which the do not span 
the entire eigenspace of N with eigenvalue ri, that is, there exists an eigen¬ 
vector i >]/„, (which is orthogonal to all the <j> ik )) of N with eigenvalue ri, which, 
according to (3), lies in if. From ri # 0 it follows that A\jj„. # 0 and 
= (n' — \)(Aij/„.). Since A\j/„' is also orthogonal to all 1; this result 
contradicts the condition that ri is the smallest eigenvalue for which the 
do not span the entire eigenspace. 

Let :rif K) denote the subspace spanned by <j) (k) for fixed k; clearly 

(5.16) 

k 

The operators A, A + , H, P, Q all act in the various in the same way; 
thus M' can be expressed as a direct product as follows: 

Jf = x (5.17) 

with the correspondence 

<l> ( P~<t> H tS k \ (5.18) 

where the (f>„ form a complete orthonormal basis for and the u (k) form a 
complete orthonormal basis for 3fC 2 , and the operators A, A + , H, P, Q all 
have the form 


A x l, A + x 1, etc. 


(5.19) 


and the A, A + ..., as operators in ^ obey (5.11), (5.13) and (5.14). It there¬ 
fore suffices to consider the above operators in where N is maximal 
(according to Th. 1.6 we need only choose <p = «„</>„ where ot„ # 0 and 

|a„| 2 < oo). For there exists an N-representation given by f «-><«!/>• 
From 




Q = —t=(a + a + ) 

V2 


(5.20) 


it follows that 


P^n = j4(V”^»-i “ V n + 

Q4>n = -J=(.y/n<j>n-i + sjn + 


(5.21) 
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For / = Y,n ( l> n ( n I /> it therefore follows that, in the /V-representation that 
iV<n|/> = n<n|/>, 

P<n\f) = <n|P/> =^(v / «<n - l|/> - Jn + l<n + l|/», 

e<n|/> = <n|Gf> = ^(V^(»- 1 !/>+x/^+ 1 ' <«+l|/»> 

1 1 


<n|P|m> = - \/ m + 1S n,m+ l)> 

<n|<2|m> = + v / rnTT<5„ >m+1 ). 


(5.22) 


In order to find the (^-representation of it is necessary to solve the eigen¬ 
value equation for Q: 


Q(n\x > = x(n\x). 

Using (5.22), (5.23) is transformed into 


(5.23) 


h + l 


<n 4- 11x) + /-<n — l|x) = x<n|x) 


if we substitute 


<«|x> = 



(Q\x)H„(x) 


(5.24) 


from (5.23) we obtain the recursion formula 

H n = 2xH n _ 1 — 2(n — l)//„_ 2 (5.25) 

with the initial value H 0 — 1. The functions defined by this recursion 
formula are called the Hermite polynomials. They are identical to the expan¬ 
sion coefficients of the “generating function” 

f(x, t) = e-' 2 + 2tx = e*V< ! -*> 2 = £ H n (x) - (5.26) 

n = 0 n - 

from which it follows that 


H n (x) = 


d 2 

dt 2 


/CM) | t=0 


(-l)V 2 


d n e~ x2 

dx" 


(5.27) 


Since 


dx 


f(x, t ) = 2 tfix, t ) 
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from (5.26) we obtain the relation 


and, from 


H' n (x ) = 2nH„_ 1 (x) 


— fix, t) + 2 (t - x)f(x, t) = 0 


(5.28) 


we obtain the relation (5.25). Since from (5.26) it follows that H 0 (x) = 1, we 
have proven the equivalence of (5.25) and (5.26). 

The <n|x> in (5.24) are improper eigenvectors since Yf= 0 \(n\x) | 2 = oo. 
We note, however, that 


<n|<p> 


fX 

Jx — t 


<n|x'> dx' 


are approximate eigenvectors of Q since (as we will shortly find) 
0 \{n\(p}\ 2 < oo. We will now determine <n|0> from the normalization 
condition 


Y (x'\n}(n\x} = S(x — x'). 

n = 0 

For this purpose, using (5.27), we obtain 

9U, x') = Z 2 ^, H n (x)H n (x’) = e*’ 2 Y H„(x)^£,Je 
From the Fourier expansion 


(5.29) 


-*■> = _L f 

2 y/n J- 


we obtain 


e -i<ox’ e -0,2/4 da) 





e~ icox (ia) n e'~‘ o2/4 da 


and, using the generating function in (5.26), it follows that 


1 f°° 1 

g(x, xf = e* 1 -- 7 - I e-“-e-“ 2 /4 X rj-r H n (x)(iaT da 

2^J 71 J — oo n ^ M • 

= e x ' 2 —^ 7 = f e -i‘ox' e -<o 2 /4 e -Ua>l2)* + 2(ia,l2)x fa 

2y/n J- 00 

= ■ s fne x ' 2 — f da = ^/ne* 2 d(x — x'), 

2 TtJ-oo 

where we have used the following identity from the Fourier integral theorem 


Jf g i«(x *') da) _ ^ _ x ') 
2^ J-oo 
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Therefore, instead of (5.24) we can write, 
(n\x) = 1 


H„(x)e~ 


Each vector / e can therefore be represented in the (/representation by 
a function <x | / > as follows: 

<x|/> = E <x|n><n|/>. 

n 

In particular, the operator P can be expressed in the g-representation as 
follows: 

E<x|/> = E <x|n><n|P|m><m|/>. 

n, m 

From (5.28) and (5.25) it follows that: 


E <x|n><n|P|m> = <x|m> 

H I ClX 


and we therefore obtain 


P<x\f>=\-f<x\f>. 

i ax 


The g-representation can therefore be chosen such that (5.31). In the g- 
representation we must, of course, have Q(x | /) = x<x | /), as can easily be 
shown. 

In the g-representation the Hamiltonian operator H is given by 

s ) 2 + * 2 } 

In particular, we therefore obtain 


H<x\n) = ^ (jJxj + x 2 <x\n) = (n + ^j(x\n). (5.32) 

If, instead, we begin with the g-representation which was given in VII, §4, 
then we could have used (5.32) in order to calculate <x | n) and then transform 
to the iV-representation. We have taken the opposite route in order to 
demonstrate the methods of §§1-4. 

We will now briefly outline how we may make the transition from the g- 
to the P-representation. In order to determine the transformation coefficients 
<x|p> we proceed from the fact that in the g-representation the <x jp) 
must be (improper) eigenvectors of P: 

T-^-<x|p> = p<x|p> (5.33) 

i ax 

from which it follows that 

<*|p>= * **. (5.34) 

y/2n 
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The factor already satisfies the normalization condition 
f <P'\x}<x\p) = d(p - p’). 

J — CO 

The relation (5.34) may be formulated precisely in the sense of §4. <x| p} is 
an improper function of two variables x and p : 

rP r-x 

<x|p>: dp' dx\x'\p). 

Jp — e Jx—d 

The reader may visualize the concepts of §4 by means of this example. 

We shall now use the <x |n> and <x | p) in order to explicitly compute the 
probabilities which were discussed at the end of §4. 

Given an ensemble W for which the energy has zero dispersion, that is, 
W = P 4>n for some n, then the probability of a position measurement is, 
according to (4.18) given by the probability density 

w(x) = |<x|n>| 2 . 


In Figure 1 <x|n> is displayed; the behavior of |<x|n>| 2 as a function of x 
can readily be ascertained. 

Conversely, given an ensemble W for which the position is in the interval 
x — e, x with certainty, it follows that the probability that a measurement of 
the energy will be (n + |) is, according to (4.21), given by 


w 


n 



dx"K n (x", x')<x' | W |x">, 


(5.35) 


where 


K n (x', x") = <x' | n) <n | x"). 
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From Figure 1 it is clear that, for small e, the function K„(x', x") is, for all 
practical purposes, constant in the interval x — e • ■ ■ x providing that n is 
not too large! If n is not too large, then, according to (4.24) we obtain 


w„ « |<x|n>| 2 c w , where c w 



dx\x'\W\x"). 


(5.36) 


From this example it is clear why (5.36) cannot be a good approximation 
to (5.35) for all n. 

The above derivation makes it possible to determine those ensembles P (f> 
for which equality holds in the Heisenberg uncertainty relation IV (8.3.18). 
According to the discussion immediately following IV (8.3.17) equality can 
hold if there exists a real number a for which the relation 


holds, where 


(6' + iaP') = 0 


(5.37) 


P' = P - tr(P„P)l = P - pi, 

Q' = Q - tr(.P„e)l = Q - ql. 

Q' + iaP' is, however, except for a factor and an origin displacement, 
identical with the operator A or B = A + in (5.6). Therefore (5.37) can only 
be satisfied for a > 0 and must correspond to the “ground state” of the 
harmonic oscillator. We may express (5.37) in the ^-representation as 
follows: 


d 1 

x — q + a --iap <x | <p> = 0. 

ax 


Thus it follows that 


(x\cpy = 


_ e ~[(x-q) 2 /2ix]- ip(x-q) 

x/2\/n 


(5.38) 


which is also normalized, is a Hilbert space vector only if a > 0. For the 
position probability density in the ensemble W = P^ we therefore obtain 


|<x|<p>| 2 =— 1 
la-Jn 

This form is known as the “Gauss distribution (Gaussian).” 


(5.39) 


6 Degenerate Spectra 

The T-representation of a Hilbert space is possible if A is maximal, that is, 
if the complete Boolean ring E generated by the spectral representation of 
A is maximal (D 1.1). E is maximal if there is no decision effect E $ E which 
is commensurable with all elements of E. If a v is an eigenvalue of A and P v 
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is the projection operator onto the eigenspace for the eigenvalue <x v then each 
P v e 2 and each E < P v are commensurable with all elements of 2. If A is 
maximal, then from E < P v it follows that either E = P v or £ = 0, that is, 
P v must be one-dimensional. Therefore, to a v there exists a uniquely deter¬ 
mined eigenvector (up to a factor) cp v . In this case we say that the eigenvalue 
is nondegenerate. If Sp(/1) has only discrete values and is nondegenerate 
then 2 is maximal (see §5) and P (f)v e 2 for the complete orthonormal basis 
of the eigenvectors of A. If E # Ois commensurable with all P Vv (and therefore 
commutes with all P qv ) then it follows that EP Vv = P qv E < P qv , that is, 
Ecp v = (p v or Eq> v = 0 and E = £' v P qv where 2' is the sum over all v for 
which Ecp v = <p v . Thus it follows that E e 2. 

D 6.1. The dimension of a projection P v onto an eigenspace of A corre¬ 
sponding to the eigenvalue a v is called the degree (order) of the degeneracy 
for a v . Note that the statements “a v is nondegenerate” and “a v is degenerate 
of order 1” have the same meaning. 

We shall now outline how we may obtain an ^-representation for a Hilbert 
space for an A which is not maximal, and how the notion of degree of de¬ 
generacy may be extended to the continuous spectrum. 

As we have already mentioned at the beginning of §1, if we are initially 
given a nonmaximal A, then there exists a finite set of commensurable decision 
observables A, B, ..., the combination of which is maximal (D 1.1). The dis¬ 
cussion of previous sections is applicable to this case. If, for example, A and 
B are commensurable, then instead of an ^-representation we will obtain an 
(A, .^-representation by means of proper or improper functions in the 
(a, /)-plane as follows: / <-> (a, /]/>. Since the extension of the preceding 
formulas is trivial, we shall not discuss them here. In the following we shall 
present the case of position representation for several Q v . 

In order to define the notion of the degree of degeneracy more generally 
we shall begin with Th. 1.9. Suppose that E„(n = 1, 2,...) is a denumerable 
set of pairwise orthogonal decision effects which are commensurable with A ; 
so that the complete Boolean ring generated by the spectral family of A and 
by the E„ is maximal and that from EE„ = 0 for an E in the Boolean ring 
generated by the spectral family it follows that EE m = 0 for all m > n. For 
the operator B = £„ =0 nE„ and E 0 = 1 - £„ =x E n we find that A and B are 
a maximal pair (D 1.1) of scale observables, that is, the complete Boolean 
ring 2 generated by the spectral family of A and by the E„(n — 0, 1, 2,...) 
is maximal. According to Th. 1.3 there exists a (p for which 5/2, <p) = Jf. 
Therefore the discussions in §2 can be directly applied to A, B. 

For an interval L in the (a, //plane of the form L — (a l5 a 2 ] x (//, /i 2 ] 
we define the measure n as follows: 

P(L) = \\E^ A \l)E^{J)cp\\\ (6.1) 

where 1 — (oq, a 2 ], J = (/ l5 / 2 ] and E (A) (X), E (B \X) are the spectral families 
of A and B, respectively. Jf can therefore be represented in terms of i? 2 (R 2 , n). 
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Because of the special character of B we may simplify the representation 
using functions/(a, n) for n = 1,2,... for which 

X f | /(a, n) I 2 dn„( a) < co, (6.2) 

n = 0 J 

where 

find) = \\E (A \l)E n (p\\ 2 . (6.3) 

The inner product is given by 

if, f) = Z /( a > n M<x, n) dn„( ot). (6.4) 


We denote the space of quadratically integrable functions /(a, n) (in the 
sense of (6.2)) by if 2 (R x N + , n). Let Sp„(d) denote the set {a| for all a. u 
oc 2 where < a < oe 2 , it follows ||(£ u> (a 2 ) - £ M) (a ,))£„<? II 2 ^ 0}. 

We shall now show that Sp„(d) id Sp m (d) for m > n: If a £ Sp„(d) then 
there exists an interval a x < a < a 2 such that (£ (/1) (a 2 ) — E fA) (tx l j)E„(p = 0 
and therefore we obtain (£ </l) (a 2 ) — E IA) (a i ))E„ = 0. Thus, from Th. 1.9 it 
also follows that (E (A \a 2 ) - £ (X) (a!))£ m = 0 for m > n from which it 
follows that Sp„(/1) => Sp m (d). 

We shall say that n is the degree of degeneracy of the value a in the con¬ 
tinuous spectrum of A if n is the largest number for which a e Sp n (4). Here 
we permit the value n = co. 

It is easy to generalize the special case of operators Q and P which satisfy 
the commutation relations (5.1) onto a set of operators Q h P ; where i = 1, 
2,...,/which satisfy the commutation relations: 


PtQk 


Q k Pi = ~ <5*1, 
1 


QiQ k -Q k Q, = o, p t p u -p u p t = o. 


We introduce the operators 


Ai = —j= (Qi + iPi), 

72 


A? =-,= (Qi 

72 


iPd 


and the operator N = Yl= 1 A*A t . In the same manner as in §5 it is easy to 
show that N has eigenvalues n = 0, 1,2,... and there exists a vector 4>o f° r 
which Ai4> 0 = 0 for all A t . For the complete orthonormal basis (j) { Q for the 
eigenspace of N with eigenvalue 0 the set of vectors 


<t>nL 


"/ 


1 

777 




forms a complete orthonormal basis in Jf. 
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We may, as we have done in §5, express the Hilbert space in the form 
^ x where the Q h P t are expressed in the form Q t x 1, P t x 1; 
then, in the position representation J^ b takes on the following form 

(peJf b : (p*->(x u x 2 ,...,x f \(p), 

Qj(p*-+ Xj(x u x 2 , ■ ■ ■ , x f | (p}, 

„ 1 S . 

P j(p^^-fo< x l,X 2 ,...,X f \(p}. 

Jtf, is often called the orbit space (see VII and VIII). 
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CHAPTER X 


Equations of Motion 


In VIII we have established that VIII (6.26) and (6.27) are, in a formal sense, 
the most general form for the description of the time translation of a 
registration apparatus relative to a preparation apparatus. In VIII, §6 
we indicated that we would like to call VIII (6.2.7) the “equations of motion 
for quantum mechanics.” In this chapter we shall only be concerned with 
formal transformations of the above time translations. For that reason we 
shall take VIII (6.27) and (6.16) as our starting point. It will then be possible 
to repeat all the formal transformations for VIII (6.26) instead of VIII (6.27). 


1 The Heisenberg Picture 

We shall call the description of time translation based upon VIII (6.27), 
that is. 


f = + <U) 

the so-called Heisenberg picture. In the computation of statistical quantities 
and expectation values of the form lv(WB t ), W is taken to be constant. 

For the special case in which B t does not “explicitly” depend on t, that is, 
dBJdt — 0, it follows that 


B t =U t B 0 U?, (1.2) 

40 
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where, according to VIII (6.16), the unitary operator U satisfies the dif¬ 
ferential equation 

<>' 3 ) 

where U 0 = 1. The solution of the differential equation (1.1) for the case 
dBJdt = 0 follows, therefore, from the solution of (1.3) given by (1.2). For 
the special case in which H is independent of time, a formal solution of 
(1.3) is given by 

U t = e m . (1.4) 

Let A and H be a complete set of commensurable scale observables; let ot 
and e denote the values of the spectrum of the self-adjoint operators A and H, 
respectively. Then U, is the operator which multiplies the vectors (s, a|t//> 
(the vectors i // in the //^-representation) by e ia . If we represent B 0 by a 
matrix, then we may express B t by the following matrix: 

<£, a\B t \E', a') = e ,(E “ E)t <8, a|J3 0 |fi', a'). (1.5) 

The fact that B t is obtained from B 0 by means of a unitary transformation 
U t implies that B, has the same spectrum as B 0 because the spectral repre¬ 
sentation £j(2) of B, is obtained from the £ 0 (2) of B 0 as follows: 

E t (X) = U,E 0 (X)U;. (1.6) 

The solution of the eigenvalue equation B t cp t = ficp t follows simply from 

B 0 <p 0 = fhp 0 and is given by <p t = U t cp 0 . The physical meaning of this situa¬ 
tion is as follows: The possible measurement values for a scale observable 
cannot change with time if there is no time variable external field. 

From (1.6) it follows that the statistics of the measurement values do 
change with time. The statistics of the measurement values will remain 
constant with time for every ensemble only if B 0 commutes with all H t 
for all t —then we will obtain B t = B 0 and £,(2) = £ 0 (2). In the case in which 
B, = B 0 we say that B = B 0 = B t is a constant or integral of motion. 

For our first example of the application of the Heisenberg picture we shall 
consider the case of a one-dimensional harmonic oscillator under the 
influence of an external field. This example is more than an academic one; 
it plays an important role in many realistic physical problems—for example, 
as an approximation in molecular physics (see XV, §8) and as a problem in 
quantum field theory. 

For the external field we shall choose the simple potential field given by 
cj>(x, t) = —xk(t). Then the Hamiltonian operator H is given by 

# = 2 m P2( ° + “T Q2(t) ~ Qm (L7) 

where P(t) and Q(t) are the momentum and position operators, respectively. 
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In addition to scale observables P(t), Q(t), we shall also discuss the time 
behavior of the “energy” 

1 .. nr co „ -5 . 

H°(0 = — P 2 (t) + — Q 2 (t). 


For each value of t the operators P(t), Q(t) satisfy the Heisenberg commuta¬ 
tion relation (see IX (5.1)) 

P(t)Q(t) - Q(t)P(t) = \ l. (1.8) 


Since much of the mathematics development for this problem is completely 
analogous to that presented in IX, §5 we shall only present an outline. 

We introduce the operator A(t ) and its adjoint A + (t) as follows: 


m = 7 

From which it follows that: 



ma)Q(t ) + 
m(oQ{t) — 



'mco 

i 


P(t) 

P(t) 


'mco 




(1.9) 


A(t)A + (t) - A + (t)A(t) = 1, 


H 0 (t) = coA + (t)A(t) + 11, 

1 (1-10) 

H, = H 0 (t) - —= ( A(t ) + A + (t))k(t). 

y/2m(o 

For each value of t there exists a (normalized) vector ^ 0 (t)eJf which 
satisfies Xi(t) x F 0 (t) = 0. The vectors 

^(0 = ~A + (t)^ 0 (t) (1.11) 

V n - 

form a complete orthonormal basis in Jf. They are also eigenvectors of 
H 0 (t): 

= (n + |)coT„(f). (1.12) 

According to the Heisenberg picture we obtain the time evolution of the 
observables of interest P(t), Q(t), H 0 (t ) if we obtain A(t) from the differential 
equation (1.1): 

-A(t) = H t A{t) - A{t)H t = -coA(t) + 1. 

1 y/lmco 

The solution is very simple: 

A(t) = c-“%4(0) + (1.13) 
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where 


m = -jL= fk(r)e™ dx. (1.14) 

yjlmco Jo 

We shall write 'F„(0) = <j)„, so that from (1.10) we therefore obtain: 

</> n = 4=^ + (°)"</> 0 . (1.15) 

v n - 

According to the general considerations about the eigenvectors and eigen¬ 
values of B t we may obtain the eigenvectors of H 0 (t) from those of H 0 ( 0) 
by means of the operator U ,. Equation (1.13) gives us the value of the right- 
hand side of the following equation: 

A(t) = U t A(0)Uy. 

Then from (1.11) we obtain 

^(0 = u t A + (oyu; 'To(0. (1.16) 

The eigenvector T 0 (f) of H 0 (t) corresponding to the eigenvalue co/2 must 
have the form 


Voit) = aW t (/> 0 , (U7) 

where we may choose a(t) = 1. For computational purposes, however, it is 
desirable to refrain from this choice. From (1.16), (1.17) it therefore follows 
that, in general, 

«?„(!) = a(t)U t cf) n (1.18) 

U, is therefore determined if T„(t) is expressed in terms of the orthonormal 

basis of the </>„ and if a(t) is given. 

The expansion 

^(0 = E ^(0) (1-19) 

m 

is determined when we have computed <</> m , T„>. From (1.11) it follows that 

<4> m , ^(0) = - <Am m , 'Fo(O). (1.20) 

V n ‘ 

According to (1.13) we obtain 

.4(f)" = e - inmt £ | j/M(0)"~ v . (1.21) 

Furthermore, from (1.15) and the first equation in (1.10) it follows that 

^(O) p 0 m = y/m(m - l)-..(m - p + 1) (f) m _„. (1.22) 
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In order to find NP„) it therefore suffices to compute <0 p , 'F 0 >: 

< 0 ., 'ivo) = -4 <A + (on 0 , 'Po(O) = -U (^o, ^(or^oCO) 


= - 7 = <^o. C^(Oe iM ' - /(03^o(0> 

Vp'- 

= 4= [-/(O] p <0o, 'Po(0>- 

VP! 

Since |PF o (0ll 2 = £ P I <0p> ^oCO) | 2 = 1, we must have | <00.^0(0>l 2 
= e ~ l/|2 . Since we have not yet chosen the factor oc(t) in (1.17) we may choose 
<0 O , *P 0 (t)> = e~ (1/2),fl2 . Then we obtain 

<0„, »F 0 (t)> = 4r [-/(0] p c- (1/2)l/(,)|2 . (1-23) 

VP 1 

Then, from (1.20) to (1.23) we obtain: 

yyt f 

<0 m , ^(0) = 4 gi»«Wg-(l/2)|/(0| J [ _ /(f)] m-» i 

\ / W 


where <r = 0 for m > n and a = n — miox m <n. 
The pth derivative of the Laguerre polynomial 


(m — n + v)! ’ 


is given by 


L&x) = I 


/ v=o \v/ (v — p)! 
Therefore, for m < n we obtain 


(-xy-'X-tr 


yyi I 1 _ 

<0 m ,'P„(O>= / n 1 -re i "“ I e- <1/2) ' /w ' 2 (/(0"- m Lr m (l/(0| 2 ) (1.25a) 

■y n! n! 

for m > n we obtain 

<0 m ,'P„(O> = Hi Te - e - <1/2)| / W P(-/( 0r - nLrn( |/( t )| 2 ). (1 .25b) 

V m! m! 


Thus the expansion (1.19) and the operator U, defined by (1.18) is determined 
up to the factor a(t). 

With the aid of the <0 m , T B (f)) given in (1.25a) and (1.25b) we may now 
answer the following physical question: Suppose we are given a preparation 
apparatus which prepares an ensemble W for which the measurement of 
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H 0 ( 0) results in the value a >(m + j) with certainty. Therefore we must have 
tr (WP^J = 1, from which it follows that W = P^ m . What is the probability 
for the same ensemble that a measurement of H(t) will result in a meaurement 
value of c o(n + |) (that is, for a time displacement t of the registration ap¬ 
paratus for H 0 ( 0) relative to the preparation apparatus)? It is given by 

tr (WP Vnft) ) = tr (Pt m Pv nW ) = I <<p m , ^(t)>I 2 - (1-26) 

It remains to compute the factor a(t) in (1.18). From (1.17) and (1.3) it 
follows that: 

*0(0 = oiU t <t>o + <xU t (j )o = “ aU,<j) 0 + mH t U,(l) 0 
= ^'¥ 0 (t) + iH t '¥ 0 (t) 

and we obtain 

<'F o (0, *o(0> = - + i<y 0 (f), 
a. 

From H t in (1.10) it follows that 

l = - y + <%>(*), * o (0>- 


In order to compute the inner product <T 0 , * 0 > we begin with T 0 = 
v f , o) and use (1-23); we obtain: 


^o(0 = I <Pn( —f ~ <1/2)l/( ° |2 

n 


1 



By differentiating with respect to time it follows that: 

* o (0 = 1 ~ i(-/)"(// + //)}• 

n Jn\ 


Therefore we obtain 


Wo, * 0 (o >=urf - in = +/(o^ to ‘) 


2 


mco 


from which we obtain the following equation for a: 


l = - y + \ -7= (/( t)e imt + f(t)e-n. 

a L L ^2010) 


With a(0) = 1 we finally obtain 


,. . icot 

a(0 = exp? - — + 


' ~ 7 == fk(r)(f(r)e imt + f(x)e~ im ) dr). (1.27) 
1 y/2ma Jo ) 


Thus we have completely determined U t . 
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For our second example illustrating the Heisenberg picture we shall 
consider an elementary system of mass m in the absence of an external field. 
Here we have H = (1/2m) Yj= 1 Pf- For P ; (t) and 2X0 it follows from (1.1) 
that 

PM = 0, 2,(0 = - P/t), 

m 

P t is therefore constant with time, and we obtain 

2.(0 = 2i(0) +-Pi • (1.28) 

m 

Thus it follows that the expectation value for the position for the ensemble 
W is given by 


M(2,(0) = tr(fF2 ; (0) = M(2i(0)) + - M(Pi), (1.29) 

m 

that is, the expectation value of the position moves with constant velocity 
(1 /m)M(Pj). If we set 

2X0 = 2X0 - M(2X0)i, 

P'i = Pi - M(Pi) 1 , 

then, from (1.28), (1.29) it follows that: 

2X0 = 2X0) + -PJ. (1.30) 

m 

For the dispersion 

disp(2,(0) = tr(wm 2 ) (1.31) 


we obtain the relation: 

disp (2X0) = disp(2X0)) + - m(2;(0)p; + p;2X0)) 

m 

t 2 

+ — disp (P ; ). 
m 

It is easy to compute the time t m for which disp(2,(t)) is a minimum. For t m 
we obtain 


M(Q'i(t m )P'i + P'iQKtJ) = 0 


from which it follows that 

disp(2X0) = disp(2XO) + --i J - disp (Pi). (1.32) 

m 
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Equation (1.32) shows that the dispersion of 8; is quadratic in its time 
dependence relative to its minimum value. In addition to (1.32) the Heisenberg 
uncertainty relation IV (8.3.18) must be satisfied 

[disp(.P ( )] 1/2 [disp((2 ; (t)] 1/2 > ± (1.33) 


If (1.33) is an equality for t = t m then at other times the Heisenberg uncertainty 
relation can only be satisfied as an inequality. 

In addition to the expectation values, it is also possible to obtain answers 
to questions concerning the probabilities for different position and mo¬ 
mentum values. In order to make computations, we shall consider the 
special ensemble W = P v . 

The probability for a momentum value in a region n of momentum space is 
constant with time and is given by 


I <Pu Pi , P3 1 <P> I 2 dpi dp 2 dp 3 , (1.34) 

*71 

where < p u p 2 ,p 3 1 cp) is the representation of cp in the momentum representa¬ 
tion. We will now compute the probability for a measurement of the position 
Qi(t). For this purpose we shall transform from the momentum representa¬ 
tion to the (<2i(0> 8 2 (0. 83 ( 0 ) representation. In order to obtain this 
representation, we seek the solution of the equations 


Qj(f)q XuX2 ,x 3 = Xjgx 1 .x 2 .x 3 ( 1 - 35 ) 

in the momentum representation, where Q/0) is the operator — (1 /i)(d/dpj). 
Here (1.35) becomes 


1 d t 

T p-1- Pj 

i dpj m 


■)<Pl.P2. 


p 3 |x 1 ,x 2 ,x 3 > = x/p 1 ,p 2 ,p 3 \x 1 ,x 2 ,x 3 y 


from which it follows that 


<Pl>P2>P3l*l>*2>*3> 



2m 



From 


<Xi, x 2 , x 3 \(p} = J <x x , x 2 , x 3 1 p u p 2 , p 3 }{px, p 2 ,p 3 1 (p> dp x dp 2 dp 3 

we find that the probability that a measurement of 8/0 will result in a 
measurement value in the domain is given by: 

|<x x , x 2 , x 3 \(p}\ 2 dx x dx 2 dx 3 . 

Jy 

If (p is so chosen that, at time t = 0, it satisfies the Heisenberg uncertainty 
relations with an equal sign (where we may obtain an evaluation of IX (5.37) 
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which is analogous to IX (5.38) in the momentum representation) we then 
obtain 


<P 1 .P 2 .P 3 I 0 >> = exp 

and it follows that 

<x u x 2 ,x 3 \cp) 


12 \ 3/4 r 3 1 r 1 3 

= — 2 exp —i X (P* ~ ex P -~2 Y,(Pi- 
\na / L ;=1 J L a i =1 J 


1 / 2 \ 3/4 f f 3 / / \ 3 I 

-w 5 !;?) M')?, 2 ^ pl ) ~ 

- ( ji ) I <Pj - >Ij) 2 l <fPl #2 <iP3 


" (2^W b " 11 eXP [- 4^W I, h “ (» + ¥ 

.■“ |iTF I, [ 7 (jt ' - ^ - L {x ‘ - - 5S? "< 


■ exp 


• exp 


3 

i I njyj > 
. 3=1 J 


where b — 1/a 2 + it/m. 

Here it again follows that the probability distribution |<x l9 ..., \cp}\ 2 
propagates with velocity where M{p } ) = In this way we obtain 

a more precise description of the behavior of this probability distribution 
(not only the dispersion given by (1.32)). 


2 The Schrodinger Picture 

The expression \x{WB ,) is used to compute all probabilities and expectation 
values. If B, does not explicitly depend upon the time t, then from (1.2) we 
obtain the expression tr(WU t B 0 Uf). From the property of the trace func¬ 
tional we may exchange U, with WU t B 0 ;we obtain 

tr(W 0 U,B 0 U- ) = tr (U t + W 0 U t B 0 ), (2.1) 

where we have written W = W 0 in order to emphasize the fact that W = W 0 
is constant in time. We now define a time dependent W, as follows: 

w,= u;w 0 u t ( 2 . 2 ) 

we then obtain 

tr (W 0 B t ) = tr (W t B 0 ). (2.3) 

In §1 we have called the expression on the left side of the equation (2.3) the 
Heisenberg picture. We call the expression on the right-hand side the 
Schrodinger picture. For many purposes it is often mathematically more 
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convenient to work with the right-hand side of (2.3), the Schrodinger picture. 
In particular, the product representation of the Hilbert space introduced in 
VIII (2.1) for t = 0 does not change with time (see VIII, §2). The position 
representation of the Hilbert space (in which the momentum operators 
have the form (l/i)(d/dx)) can be taken to be fixed with time. 

If B t depends explicitly upon the time, that is, if SB Jot # 0 then, in the 
Schrodinger picture we may define an operator 

B t =UjB t U, (2.4) 

which, in the case that dBJdt — 0 is (according to (1.2)) equal to B 0 . Instead 
of (2.3) we find that, in general 

tr(W 0 B t ) = tr(W t B t ). (2.5) 

For the special case in which W 0 = P (po we obtain 

u?p vo uj= ujv 0 ((p 0 , ujy = uj(p 0 (uj(p 0 ,fy 
= = P<? t f , 

where 

(p,= U?(Po- (2-6) 

Taking the adjoint of equation (1.3), we obtain the following differential 
equation for cp t : 

<p t = UjU t cp 0 = UjU,(p t = —iUj H t U t (p t . 

Using definition (2.4) it follows that 

-y (p t = ft t (p t . (2.7a) 


In the same way we obtain the following differential equation for W t 


dW t 

dt 


= i(W t H t - H, W t ). 


(2.7b) 


(2.7a) and (2.7b) are called the Schrodinger equations. 

What is the form of the operator 5,? 

As we already mentioned in VIII, §6, in the expression VIII (5.8) for H t we 
are to use the operators P^t), Qi(t), SJt). Since PJt) = U t Pi(0)Uj, etc., 
we find that we obtain the operator H, if we replace all P ; , Q t , S t in VIII (5.8) 
by PM 6,(0), S,(0), that is, if all P t , Q h S t are chosen to be constant with 
time. Nevertheless there can be a remaining time dependence in fl t which 
may be due to the time dependence of the external field A, cp, that is, through 
an “explicit” time dependence of H t . 

In order to clarify this situation, let us consider an elementary system of 
charge e in an external time dependent field. Then we would have 

= LPi(t) - eAJQM Q 2 (t), QM t)] 2 

Zm i i = 1 

+ M 2 i( 0 , 62 (f), 63 (f), f). ( 2 . 8 ) 
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From (2.8) it follows that 

= in 0) - ^(<2i(0), G 2 (o), <2 3 (0), r)] 2 

2m i=1 

+ etpiQM, G 2 (0), G 3 (0), 0. (2.9) 

If we know that we are in the Schrodinger picture, we may omit the symbol 
(0) in (2.9) and write 

Bt = 2 l m^ lPi - eAi(Ql ’ e2 ’^>0] 2 

+ ecp(Q 1 , 0,2, Q 3 , 0 . ( 2 . 10 ) 


where we know that the P h Q t are operators which are constant with time. 

With (2.10) the Schrodinger equation (2.7a), as expressed in the position 
representation corresponding to Q u Q 2 , Q 3 is given by 


it>, I . 1 v 


~~-eA k ( Xl ,x 2 ,x 3 ,t) 

l CJX'fc 


+ e(p(x 1 ,x 2 ,x 3 ,t){x 1 ,x 2 ,x 3 \cp t ). 


(x 1 ,x 2 ,x 3 \(p t ) 

( 2 . 11 ) 


In Schrodinger’s heuristic development of quantum mechanics equation 
(2.11) is “guessed” with remarkable intuition. From (2.11) it follows that a 
wave packet in a “weakly” varying electromagnetic field behave much like 
mass points of classical mechanics with the Hamiltonian 

1 3 

H = — £ (P k - eA k (q u q 2 ,q 3 , t)) 2 + e(p(q u q 2 ,q 3 , t ) 
zm k= ! 


(see, for example, [6], XI, §1.2). 

If H t does not explicitly depend upon time, then in both the Heisenberg 
and Schrodinger pictures H t is a time-constant operator: H t = H t = H 0 . 
The Schrodinger equation (2.7a) is then given by: 

~~(p, = Hcp t , where H = H 0 . (2.12) 

Let s denote the values of the spectrum of H and let a denote the degeneracy 
index for H. In the //-representation (2.12) becomes 

1 d 

- ~ — <£,<x\(p t ) = e(e,a.\(p t y (2.13) 

i at 

the solution of which is given by 

<e, a|<p,> = e~‘ s '<£. a|<p 0 >- (2.14) 


For examples we shall again consider the case of a harmonic oscillator and 
a free elementary system. 
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Let U t denote the unitary operator for the harmonic oscillator which was 
computed in §1. The solution of the Schrodinger equation (2.7a) is given by 

<p t = u;<p 0 

and, in the N — A 4 (0)/l (O)-representation we obtain 
<»l <Pt> = J]<»l^ + |m><mko> 

m 

= E <™\U,\n)(m\(p 0 ). 

m 

According to (1.18) 

<4>m, U,<t>n> = «(0<> | L, | «> 

and we find that 

<m\U t \n> = a(t)<(/> m ,'P„>. 

Therefore we obtain 

<n\(p t > = “(0 E &»><»» I <Po> 

m 

= «(0E <<£*»><* I Po>, (2-15) 

m 

where T„) is given by (1.25a), (1.25b) and a(t) is given by (1.27). 

Instead of taking a somewhat circuitous route using the Heisenberg 
picture, we shall instead directly solve the Schrodinger equation (2.7a) in 
the position representation where the latter is given by 

2 

171(0 

+ -~^~x 2 (x\ q> t ) - k(t)x(x\(p t y. (2.16) 

For /(f) as defined in (1.14) we shall set 

y(t) = —/== (e- im f(t) + e i0>, mx 

^/2moo 

n(t) = - fit))- (2-17) 

i V 2 

y(t) and n(t) are solutions for the motion x(t), p(t) for a mass point according 
to classical mechanics! Thus we find that solutions of (2.16) are given by 


9n(x, t ) = 



(mco) 1,2 H n ((mco) 1,2 (x — y(t))e 


(m<a/2)(x-y(!)) 2 


. e in(t)x £ - i Sq (it(t)/2m) dz £ - ico(n + l/2)t 


( 2 . 18 ) 
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where 


H n ((mco) 1,2 x)e 


-(mco/ 2)x 2 


are the normalized eigenfunctions of 

-Lpl + —Q2 
2 m 2 K 

which correspond to the eigenvalues (n + £)c o (see IX (5.30)), that is, 


mco 


2 2 n! 


H„((mco) 1/2 x)e- (ma2/2>x2 = <x|n>. 


(2.19) 


From (2.18) it therefore follows that g„(x, 0) = <x|n>. The most general 
solution of the Schrodinger equation (2.16) is therefore given by 


<x\cp t > = Ydn(x,tKn\cp 0 ). 

n = 0 

Clearly (2.15) and (2.20) must be the same solution. Since 
<x\(Pt> = E <x|n><n|()9 ( > 

n 

and 

/•OO 

<n\(p t y = <n|x> <x|<p,> dx 

* — 00 

from (2.20) it follows that 


<n\(p t y 


m = 0 


( 2 . 20 ) 


/»00 

(n\xyg m (x,t)dx <m|<p 0 >- (2.21) 

y —oo 


Since the <m|<p 0 > were arbitrary, by comparison of (2.21) with (2.15) we 
obtain 


/•OO 

ottKVJt), cl> m y = (n\xyg m (x, t) dx. (2.22) 

J - 00 

We now seek to answer the same physical question as we have in §1. Suppose 
that we prepare an ensemble for which the measurement of the observable 
H 0 (which is constant in time in the Schrodinger picture!) has, at time 0, the 
eigenvalue (m + %)a> with certainty. Therefore we must have W 0 = P,„ a 
and <«|<p 0 > = <5„ m . If, for the same ensemble we measure the observable 
H 0 “at time f” we must use W, = P <Pt . For the initial value (n\q> o y = S nm 
from (2.21) it follows that 

/•OO 

<«!<?,> = <n\xyg m (x, t)dx. (2.23) 
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The probability that the measurement value will be co (n + is given by 
| <n| cp t ) | 2 in complete agreement with (1.26). 

For a free elementary system the Schrodinger equation, as expressed in the 
momentum representation, is given by 

13 1 3 

- -T yAPi>P2’P3\ ( P,> = 7T~ £ VKV l,P2,P3\<Pt>- 
i at 2m i=1 

The general solution is given by 

<Pl.P2.P3l?.> = eX p[ _ ^ <Pl.P 2 .P 3 l?» 0 >- 

The solution of the Schrodinger equation (2.12) in the position representation 


i at 




is therefore given by 

<x 1 ,x 2 ,x 3 |<p ( > = j (x 2 , x 2> x 3 \p u p 2 , p 3 } (p x , p 2 , p 3 \(p t } dp u dp 2 ,dp 3 


• <Pi» Pi, P 3 l<Po> dp I, dp 2 , dp 3 . 


where 


<Pi»P2.P3l<Po> = J <Pu Pi, P 3 \x u x 2 , x 3 } (x v x 2 , x 3 \(p 0 ) dx x dx 2 dx 3 . 
If we let 

3/4 r 2^ [(Xj - yj) 2 i , , 

exp|^-fl £ [ 4 - ^2 *A X J “ Py) 

then from <x 2 , x 2 , x 3 1 (p t ) we obtain the same physical results for the time 
behavior for the position probability distribution as we obtained for the 
Heisenberg picture at the end of §1. 



<x u x 2 ,x 3 \(p 0 } 


- ( a2 \ 
\2nJ 


3 The Interaction Picture 

Frequently we find that it is possible to separate the Hamiltonian operator 
into two components H = H° + H 12 , where the spectral representation of 
H° is easy to determine. In applications H° frequently has the following 
form: H° = H° x 1 + 1 x H 2 , which refers to the product decomposition 
x Jf 2 “at time t = 0” for a composite system containing a system 
of type 1 and of type 2. JT°, H 2 are the Hamiltonians for systems of type 1 
and type 2, respectively; see VIII (2.1). H 12 is often called the interaction 
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Hamiltonian, from which we obtain the expression “Interaction Picture.” 
Here we assume that H° does not explicitly depend upon the time. Thus, 
(according to §2) H° is a time-independent operator. Thus it follows that 

e iB°t = e iSo t x e mo t (3.1) 

Thus it follows that 

e iS °‘(B 1 x B 2 )e~ iSo ' = (e^B.e-^) x (e iR ° l B 2 e ir, ° l ). (3.2) 

Equation (3.2) means that the product representation 1 x Jf 2 

remains invariant under unitary transformations of the form (3.1), a result 
which is very practical (see XVI). 

With W t defined by (2.2) it follows that, for the expectation value of an 
observable in (2.5) 

tr(^B ; ) = tr (e iSOt W t e- iaOt e iS0t B t e- iS0 ‘) 

= tr(F t + W 0 iy fio, B t e- ifio, ) ) 

where 

V, = U t e~ iSot . 

If we simplify our notation as follows: 

B, = e il1ot B t e- iito ‘ and W, = V t + W 0 V, 
then the expectation values can be written in the form 

tr(W,B ( ). 

In (3.6) the operator W, which describes the ensemble and the operator B, 
are time dependent. The time dependence in (3.6) is therefore divided among 
both factors as follows: B t depends upon the time t as if H° was the 
Hamiltonian operator, that is, in the case (3.1) as if there was no interaction 
between the subsystems. B ( therefore corresponds to the Heisenberg picture 
without interaction. 

The time dependence of W, arises therefore from the interaction. Similarly, 
as in the Schrodinger picture, for the special case W = P ll/Q it follows from 
(3.5) that: 

W, = P*, where & = V^ 0 = e iSOt U^ 0 . (3.7) 

From differentiating with respect to t and using (1.3) it follows that 

= -H°4>, + 

= -By, + e iBo, U*H t U t e- iSl V,- 

We find that 

l/ t + H t U t = H t = H° + ft] 2 . 


(3.3) 

(3.4) 

(3.5) 

(3.6) 
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Since H° commutes with e iSo ‘, we therefore finally obtain 

— t <A( = H 1 Vt> (3-8) 

where 

H t 12 = e i&0t Hl 2 e- iS0 ‘ (3.9) 

represents the operator H 12 in the time dependence of the Heisenberg 
picture without interaction (see (3.5)). 

As in §§1 and 2 we shall again consider the example of forced oscillation 
of a harmonic oscillator with 

H ° = i pl + e2 ’ Hl2 = ~ e/c(0 ' (3 - 10) 

For this case equation (3.8) becomes 

-j&= -wom,, (3.1D 

where Q(t) is to be computed according to the equation of motion of a free 
oscillator 


0(0 = - P(0» P(0 = —mo) 2 Q(t), 

m 


that is, 


Q(0 = 6(0) cos cat H—- P(0) sin cot. 

mco 


(3.12) 


For the 6(0)-representation of the Hilbert space, in which P(0) = (1 /i)(d/dx), 
(3.11) is transformed into 


-|<x|^) = fe(t) 

i ut 


1 . d 

x cos cot H -sin cot — 

imco ox 


<xW,>. (3.13) 

For a special solution for this equation we choose the form 


g(x, t ) = +«<*>]. 


(3-14) 


For (3.13), with this choice, we obtain 


(xk + it) — k(t)x cos cot -I- k(t)ic(t) sin cot, 

mco 


that is, 


k = fc(t) cos cot, 


ti = — k(t)K(t) sin cot 
mco 
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and we therefore obtain 

m im - Mi 

t i (3-15) 

u(t) = -/c(t)k(t) sin an dr, 

J 0 mat 

where /(f) is defined according to (1.14). In order to find a general solution 
of (3.13) we observe that 

<*#t> = Z_(x - w(t))g(x, f), 

where 


w(t) 


2ma> 


(/(t)+/(t)) 


is, for arbitrary y{z), a solution of (3.13) providing that g(x, t) is a solution. 
Thus, for the most general solution we obtain: 

<x|tA ( > = -fix - w(t))e i[XK<()+u(1)1 . (3.16) 

For f — 0 we obtain w(0) = 0, k( 0) = 0, u(0) = 0. y(x) is therefore equal to 
<x | \jj 0 y so that we obtain: 


<x|i p t y = <x - w(t)\xl/ 0 )e ilxm+m] . (3.17) 


Since the energy observable H° = (1/2 m)P 2 + ( ma> 2 /2)Q 2 is constant with 
time in the interaction picture, then so are the eigenvectors <x|n> (in the 
(/((/-representation). The probability that the energy value a>(n + l) is 
measured at time t, assuming that the ensemble is, at time t = 0, dispersion 
free with respect to the measurement value a>(m + l) is given by 


f (n\x}(x — w(t)\m}e llXKit)+mi dx . 
* — 00 


(3.18) 


which, according to (1.25a, b), (1.26) agrees with | </ m , *?„> | 2 and, according 
to (2.23) agrees with | <n | <p,> | 2 . 


4 Time Reversal Transformations 

In VII, §7 we have considered parity transformations and have discussed the 
problem that it is nontrivial to realize such a transformation by means of 
r-automorphisms. The problem of the realization of time reversal trans¬ 
formations is even more difficult to comprehend. 

The time reversal transformations correspond to the time “reflection” 
x' v = x v , t' = —t. What is the physical meaning of such a transformation? 
The transformation r x in VII (7.1) can be intuitively visualized with the aid 
of a mirror, which gives us some idea on how we may build a “mirror image” 
of a registration apparatus. What, however, do we mean by a registration 
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apparatus for which the time is reversed? Here we may consider the idea of 
making a motion picture of experiments with microsystems, and running 
the film backwards. By analogy with a mirror image, we may seek to realize 
such an experiment by running a motion picture backwards, but it is clear 
that this is impossible because the experiments are “irreversible.” We 
cannot realize the following process in reverse: microsystems are emitted 
by a preparation apparatus, and are detected by a counter (as a registration 
apparatus) which, in turn, responds. The response of the counter is not 
reversible, and cannot be realized in reverse. In general, the time direction 
of the action of the preparation apparatus on the registration apparatus by 
means of microsystems is that direction in which entropy increases in the 
macroprocesses connected with the preparation and registration. This 
directedness is an important fundamental structure underlying quantum 
mechanics which cannot be avoided—see, for example, [6], XVI and [7]. 

Therefore we shall not, at first, discuss the question whether there exists a 
realization of time reversal by means of registration procedures. Instead, 
we shall define a time reversal transformation T as a B-continuous effect 
automorphism for which we obtain a representation of the “extended Galileo 
group” by means of effect automorphisms, where the Galileo group is 
extended by the addition of the time reversal transformation 

s: x' v = x v , t' = -t. 

With the help of VII (1.1) it follows that 

(A, 3. rj, y)s = s(A, -5, rj, -y). (4.1) 

With the reflection operator r in VII, §7 it follows that 

rs = sr. (4.2) 

The operator T can, in principle, transform one system type into another, 
a fact of great importance in elementary particle physics. Here we shall assume 
(in analogy to the case of the spatial reflection (parity) operator) that T 
leaves system types invariant. According to V, §5, in the Hilbert space 
of every system type there exists a unitary or anti-unitary operator V which 
satisfies 

Ty=VyV~ 1 (4.3) 

for yeFrom s 2 = e it follows that T 2 y = y and that V 2 = e ta l. 

From (4.1) and (4.2) it follows that 

U(A, 3, n, y)VU(A, -3, rj, -y) = A(A, 3, rj, y)V, (4.4) 

U(r)VU(ry l = e ifl V. (4.5) 

In Hilbert space jf" where x t s we define the following anti¬ 

unitary operator C as follows: Let u+,u_ denote the basis vectors of r s 
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which were defined in VII, §3. The vectors of " may be written in the follow¬ 
ing form (where Q t is defined as multiplication by r ; , P ; is the operator 
1 A' grad ?): 

9 • • • 9 I (p 9 <X l9 • • • 5 i^(K2 

«l» —.an 

where each a ( runs through the indices + and —. We set 

c E <Ti> r 2 , ■■■,?„|tpiaj. a n ) u Xl ■ ■ ■ u Xri 

ai .<x„ (4.6) 

= E <J u f 2 ,...,r n \(p-,a. x .a„>(ai, a 2 ,..., a>_ ai u_ a2 • • • u_ a>> , 

«l» -.an 

where (a l5 a 2 ,..., a„) = 1 for an even number of a ; = — 1 and = — 1 for 
an odd number. C then transforms {Jf n } + and {^"}_ into themselves. 
It is easy to extend the definition of C to the more general case of a composite 
system consisting of n electrons and several atomic nuclei; here it would be 
necessary to introduce additional position coordinates and additional spin 
vectors. 

It follows that C 2 = (-1)"1 and C~ 1 = ( —1 )"C. Here it is easy to prove 
that 

CQi ~ QiC, 

CP ; = -P ; C, (4.7) 

CSi — —SiC, 

CH = HC, 

where H is the Hamiltonian operator (without external fields) given by 
VIII (5.8). 

For the reflection operator defined in VII (7.4) for Jf” = #C b x 4 S , U(r), 
defined in (4.5), as an operator in is given by U(r) = (R x 1) x (R x 1) x 
• • • x (R x 1). From (4.6) it follows that 

CU(r) = U(r)C. (4.8) 

From (4.7) it follows that 

C U(A, 6, rj, y) = U(A , - S, rj, - y)C. (4.9) 

If we multiply (4.4) and (4.5) on the right by C~ 1 , we obtain 

U(A, l ij, y)(FC)- l )U(A, 6, ij, y)~ 1 = A(A, S, ij, y)VC~ \ (4.10) 

UirXVC-'Wiry 1 = e if> VC~\ (4.11) 

From (4.10) it follows that the A(A, 5, ij, y) form a one-dimensional repre¬ 
sentation of the Galileo group; therefore !(•••) = 1. For an elementary system 
it therefore follows that VC -1 must be a multiple of the unit operator, and 
since a number in V is arbitrary, we may set 

V=C. (4.12) 
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We shall require that (4.12) also holds for the case of a composite system. We 
therefore find that, from (4.3) 

Ty = CyC ~~ 1 . (4.13) 

For the special case in which y is an element of L, that is, y is an effect, then 
there exists a B-continuous effect automorphism defined by (4.13). Note 
that this result does not, in any way, explain how the transformation (4.13) is 
to be realized in terms of registration procedures. The following question 
we cannot answer: Let (b 0 , b) be an effect procedure; for which other effect 
procedures ( b ' 0 , b') is the equation C\jj{b 0 , b)C~ 1 = \]/(b' 0 , b r ) satisfied? 

By analogy with the transformation from the Heisenberg to the Schrodinger 
picture, we may write 

tr (WCBC- l ) = tr(C~ 1 WCB). (4.14) 

We may therefore apply the time reversal transformation to the ensemble W; 
we obtain 


W-tC-'WC = CWC~\ (4.15) 

where we note that C ~ 1 = ( —1)"C. Again, we have not explained how, 
given a preparation procedure a, we may obtain another a ' such that 
cp(a') = C<p(a)C _1 . 

According to (4.9) we obtain the special case 

Ce iHy = e ~ iHy C. 

In the Schrodinger picture we find that 

W t .= e~ iHit - r) W t e mt '' t) . 

Let us define 

W' t , = CW 2l _,,C~ i 

then, from (4.17) it follows that 

W\. = Ce iH(t ~ x) W z e~ iH(l ~~ T) C~ 1 
= e~ iH( ‘~ t) CW z C~ 1 e iH{t ~ z) 

_ e -iH(t'~x)w e imt'-T)' 

Therefore W',' is also an ensemble which satisfies the Schrodinger picture. 

If we subject W t to normal time development from time t 1 tot 2 = T>t 1 
then at the time t, W x . is transformed into C W t C ~ 1 ; if we then subject CW t C~ 1 
to normal time development of r to the same interval t 2 — t x = z — t x 
until t + (r — t x ) = 2r — t u then, according to (4.19) we obtain at time 
2 t — t 1 the ensemble CW tl C~ 1 , that is, the ensemble which is time reversed 
with respect to the “initial ensemble” W tl . This means that we can run the 
“quantum mechanical Schrodinger motion picture” W, backwards in the 
form CW 2t ^,C ~ 1 , that is, this “backwards” version is a physical possibility if 
the formulation of quantum mechanics described here is a g-G-closed 
theory in the sense of [8], §§8 and 10. 


(4.16) 

(4.17) 

(4.18) 

(4.19) 
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It is therefore an important question whether for a given preparation 
procedure a, it is possible to produce an a' (at least, in principle) such that, 

cp(a') x C,(p(a)C~ 1 

is satisfied (at least approximately). Clearly, for large systems, in particular, 
macrosystems, this is not the case because of their irreversible nature; see, 
for example, [6], XV, [7], X and the remarks in Chapter XVIII. A macro¬ 
scopic extrapolation of quantum mechanics cannot therefore be a g-G- 
closed theory for macrosystems than such an extrapolation of quantum 
mechanics (see [7], X). 

In XVI, §7 we shall return to a discussion of the question whether the 
“Quantum Mechanics of Microsystems” can be a gr-G-closed theory. 
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CHAPTER XI 


The Spectrum of One-Electron Systems 


In IV, §4 we suggested that, in order to obtain a more concrete and en¬ 
compassing development of the theory, we need to establish the connection 
between well-defined (with respect to their technical structure) registration 
methods b 0 , the corresponding Boolean ring (%(b 0 ) and the operators 
g = i l/(b 0 , b) by introducing axioms which specify the mathematical form 
for the operators g = \j/(b 0 , b). 

We shall not number these axioms; instead we only note that they are 
additional axioms in the sense of the mathematical picture. We also cannot 
give a comprehensive description of the technical details of construction 
in order to formulate these new mapping principles. We are only able to 
obtain an overview of the preparation and registration procedures by using 
the concise language of the physicist in which previous knowledge of the 
physical phenomena associated with the structure of the apparatus is 
already assumed. In accord with this mode of expression we shall, for example, 
speak of a “spectral apparatus” without making reference to textbooks of 
experimental physics concerning the structure of such an apparatus, or its 
resolution capability. In XVII we shall present the first step in the develop¬ 
ment of an extension of the theory which will provide a more precise descrip¬ 
tion of the structure of the preparation and registration apparatus and we 
shall ascertain the form of the maps J' K, S' L for some examples. In 
XVIII we shall examine the extension problem in more detail without 
attempting to obtain a solution to the problem. A solution of this problem 
can be found in [7], XI. For that reason we must, at least for the present, be 
satisfied with the prospect of having to guess or discover individual axioms 
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for particular preparation and registration methods. 

It is advisable to pursue this route by beginning with special cases (one- 
electron spectra) rather than the general case (for example, the problem of 
atomic and molecular spectra). It is easier to understand how the special case 
leads to the general case rather than to understand an overview of the general 
case. For this reason we urge readers to be patient in their consideration 
of the systematics when they encounter a previously introduced axiom at a 
later point in the development as a special case of a later axiom. In this sense 
the previous comments may serve as a justification for the following dis¬ 
cussion of the light emission for the case of a single electron. This type of 
presentation permits us to show the essentials more concisely and the reader 
may eventually be able to carry out the necessary generalizations. 


1 The Effect of the Emission of a Photon 

For the present we shall not attempt to develop a systematic theory in order 
to establish the correspondence described earlier between an operator g and 
the construction rules for the registration apparatus. Here we shall often use 
partially known theories, and other theories (obtained by guesswork) in 
order to obtain an expression for the desired operator. We will then introduce 
the latter as an “axiom” in our theory. 

As is well known (see, for example, [6], XI, §1), the description of the 
emission of photons has been very successful. The so-called “spectral 
apparatus” is well suited to determine the wavelength of a photon emitted by 
an atom, and to register the photon on a “detector” (for example, a photo¬ 
graphic plate, photomultiplier, etc.). In terms of the discussions presented 
in II an atom produced by a preparation apparatus a produces a detection 
response b (with the aid of a photon) in the registration apparatus b 0 , where 
b 0 is a special spectral apparatus. Normally, a spectral apparatus is an 
integrating apparatus. It does not register each individual photon, but 
instead records the intensity of the spectral lines, that is, it approximately 
measures the number of photons which are confined to the region of a spectral 
line. 

Which operator g should correspond to the following effect: Within a 
time interval At a light quantum (photon) of frequency co is registered by the 
spectral apparatus? Here we shall not inquire about the details of the spectral 
apparatus—such as the width of the slit, the spatial orientation of the slit, etc.; 
instead, we will only inquire about the effect, that “an atom emits a photon 
of frequency co in the direction of solid angle dCl during the time interval 
At.” In this way we may obtain, with the help of numerical factors, the 
“intensity” of a spectral line measured by a concrete apparatus. 

This example demonstrates typical aspects of the theory of many registra¬ 
tion apparatuses: First a process is treated in a purely quantum mechanical 
way (for example, the emission of a photon by an atom). The next step in the 
process is then treated classically (for example, the propagation of the 
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photon in the spectral apparatus is treated in terms of wave optics). In the 
third step we encounter the problem of registration (here, let us consider a 
photographic plate) where we need to know either from the theory or from 
experiment whether all or only a known fraction of systems subjected to the 
preceding processes are actually registered. Here we shall consider only 
the first part—the emission of a photon by an atom; we shall assume that the 
remaining theory of the spectral apparatus is already known. 

At present we are unable to describe the emission process for a photon 
purely from quantum mechanics, because it would first be necessary to 
develop a quantum mechanical theory for the electromagnetic field. Because 
there is no mathematically correct formulation of such a quantum field theory 
at present, we have to rely on a very extensive workable and practical system 
of computation schemes (the so-called quantum electrodynamics) which 
functions well for most problems. We will not use these methods to compute 
the desired effect of the emission of a photon; instead, we shall give a some¬ 
what simple “rule of computation” for this problem, from which we may 
obtain the desired operator. 

Our starting point will be the classical description of the emission of 
electromagnetic radiation by means of a charge-current distribution, as 
computed using Maxwell’s equations. We will first describe the charge- 
current distributions classically by means of two functions p(x, t ) and ](x, t ). 
We shall then introduce operators for the classical expressions for the 
charge-current distribution for a single electron around an atomic nucleus. 
These operators will be inferred from the use of the wave-picture for the 
electron. Since, in the following computations p and j will occur linearly, the 
computations for the corresponding operators p, j will be completely 
analogous to those obtained for the classical functions p and j. 

The fields generated by p and j are computed from the scalar potential 
cp and vector potential A by means of the equations 

E = —A — grad cp, B — curl A. 

For cp and A we use the formulae for retarded potentials: 



p(x',t - 15c - 5c'|) j3r ., 
\x — X I 

j(x’,t - 15c — 5c'I 


( 1 . 1 ) 

( 1 . 2 ) 


We shall consider these retarded potentials for distances which are much 
greater than the region of the charge and current distribution, where the latter 
is characterized by a distance d which roughly corresponds to the “diameter” 
of the atom. In this section we shall obtain a quantitative value for d from 
quantum mechanics. For the origin of our coordinate system we will choose 
the center of mass of the atom; for all practical purposes, since the mass of 
the electron can almost be ignored compared to the mass of the nucleus, the 
center of mass coincides with the nucleus (see VIII, §§2,4,5). In the transition 
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to quantum mechanics we retain the convention that the position operators 
(after separating out the motion of the center of mass) describe the position 
relative to center of mass (see, for example, VIII, §2). Here, for simplicity, we 
assume that the velocity of the center of mass is so small that the Doppler 
frequency shift of the spectral line—the so-called Doppler broadening—can 
be neglected. Our discussions can easily be extended in order to take into 
account the effect of the motion of the center of mass, that is, for the case in 
which an ensemble W has an appreciable probability for large center of mass 
motion. 

We will now consider an expansion of (p and A according to the following 
two viewpoints: 


(1) The point x lies far outside the sources of the field, that is, [ 3c | $> d. 

(2) The wavelength A of the emitted radiation is much greater than the 
size of the atom, that is, A $> d. 

We shall now introduce an expression of <p and A in powers of d/\x\ and 
d/X as follows: 

First, it follows that (since 15c' |/| x \ < d/\ x |) 


— pry ( 1 + 

be 


X ■ X 


+ 


(1.3) 


We shall obtain an expansion of p and j in terms of the retardation in 
t — \ x — x' |. A time derivative of p corresponds to (if we consider p expanded 
in a time Fourier series) a power of the frequency co, which is proportional 
to 1/A. Thus, in the following, we can read, from which power in d/X the 
individual terms of the expansion arise. We then obtain (where the position 
dependence of p is supressed): 


p(t - 15c - x'|) = p(t - |x|) + pit - |x|)e + $p(t - |x|)e 2 + •••, 


where e = |x| — |x — x'| = x' • x/|x| + • • •. If we carry out the expansion 
only to terms of first order in x'/|x| (that is, d/\x \ and d/X) we obtain 


p(t 


x — x 


— Ixl) xx' X 5c' 

+ + (1 - 4) 


The so-called dipole moment is given by 


dit - 



x'pix', t — 15c |) d 3 x' 


(1.5) 


and the charge (which is time independent) is given by 



p(x', t — |x|) d 3 x'. 


We therefore obtain 


= Ji + W' 3( ‘ ~ |S|) + ik ~ |iD ' 


(1.6) 





1 The Effect of the Emission of a Photon 65 


For the vector potential A we obtain an analogous expression, where p is 
replaced by j. In the analogous expression for A the individual terms can be 
rewritten in such a way that their physical interpretation will be more clearly 
evident. Using the continuity equation p + divj = 0 and, integrating by 
parts, (here the surface integrals vanish) we obtain: 



t — |x| )d 3 x’ = 


J j ■ V'5c' d 3 x' = — J3c' div ) d 3 x' 
J px! d 3 x' = d(t — \x |), 


where V' is the vector operator (d/dx\, d/dx' 2 , d/dx' 3 ). 
For an arbitrary constant vector a we obtain 


Ja ■ xj d 3 x' = Ja • x’ j ■ V'x' d 3 x’ — — JxV' ■{ja-x') d 3 x' 
= — Jic'a • x! di v]d 3 x' - J x'j ■ V'(a • x')d 3 x’ 
= Jx'a • x'p d 3 x' — Jx'j-a d 3 x'. 


We may neglect the term jx'a • x'p d 3 5c' because it is of order of magnitude 
d/\ x | d/k in the expansion. We may therefore set 


J a • x'j d 3 x' = \ j*a • x'j d 3 x’ — j jx'j - a d 3 x' 

= —5 Ja x (3c' x j ) d 3 x'. 

Similarly, we obtain 

Ja-3c~d 3 5c' = ja-xjd 3 x' = Ja x (3c' x])d 3 x'. 


We define the magnetic dipole moment 

m — \ J5c' x j d 3 x'. (1.7) 

Thus we obtain 

A(x, t) = -ir d - t ^3 x m - x m. (1.8) 

|x| |x | 3 |x | 2 

From (1.6) and (1.8) we may compute the fields E and B. 

When we make the transition to quantum mechanics we will replace p and 
j by operators. In order to discover these operators we shall make use of 
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well-known expressions for p and j from the wave picture (see, for example, 
[ 6 ], XI (1.3.4), (1.3.5))(without external fields!). 

P = <#| 2 

grad i/^ + j grad , 

where we assume that 


e 




d 3 x' = 1 . 


For the “position representation at time =0” (that is, for the representation 
corresponding to ( 2 i( 0 ), Q 2 ( 0 ), 63 ( 0 )) we choose the following operators: 


p(x', 0) = eS(x — 5c'), 


j(x\ 0) = 


2m 


S(x — x') I - gradj'j + - grad- 8(x — x') 
i 1 


where these are to be applied to functions of x. These expressions correspond 
to the description of the electron as a point charge. These operators should 
only be considered in a symbolic sense, because only the integrals of these 
expressions over x! using test functions have any meaning. The operators 
p(x\ t) and j(x’, t ) are obtained from the time displacement operator e iHt 
where H is the Hamiltonian operator for the “inner motion,” that is, after 
separating out the motion of the center of mass. In symbolic form we may 
write 


p(x, t ) = eS(Q(t) - 5c'), 


M t) = ~ [<5(6(0 - x )P(t) + Pmm - *')], 

where Q(t) is the position operator at time t and P(t) is the momentum 
operator at time t. 

From (1.5) it follows that 

d(t - \x\) = eQ(t - \x\) (1.9) 

a result that we could easily have guessed. 

From (1.7) it follows that 

m = (e/4m)(Q x P - P x Q) = (e/2m)(Q x P) 

because, in the cross product Jhe noncommuting components of Q and P do 
not appear. Since L = Q x P is the angular momentum about the nucleus 
(see VIII, §2) we obtain 

m(t — 1 5c |) = (e/2 m)L(t — |5c|). (1-10) 

To (1.10) we must add the eigenmoment of the electron which depends on 
the spin as an additional magnetic moment; this topic will be discussed later 
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(see XIV, §6) and has already been discussed in the context of interactions 
with an external field B in VIII (5.8). 

With the operators (1.9) and (1.10) E and B then become operators, that is, 
we have undergone a transition to a quantum electrodynamic description 
of emission of radiation from an atom. As in the classical case we obtain only 
an approximation if we substitute the “free motion” of the atom for d(t ) and 
m(f) and neglect the radiation reaction of the electromagnetic field upon the 
atom (for this approximation, see VIII, §5). The small width of a spectral line 
is evidence that this approximation is very good, otherwise, the emission 
would strongly affect the time dependence and broaden the spectral line. 

First we shall consider electric dipole radiation. The corresponding 
quantitities will be identified with the index (1). In the fields E and B we shall 
neglect all terms which decrease more rapidly than l/|3c|; in the so-called 
radiation zone we obtain: 


£<1) 


1 X 


PC pc 



( 1 . 11 ) 


B and E are therefore orthogonal, and both are orthogonal to the direction 

5c/|x|. 

We may now consider the use of a spectral apparatus to decompose the 
fields into individual frequencies. The frequencies with which a given 
operator changes are given by 

A(t) = e iHt A(0)e~ iHt = £ E n A(0)E m e i ^- £ " )t + ■■■, 

n, m 


where the E m are projections onto the eigenspaces of H with eigenvalues e„ 
and the remaining terms (+•••) are terms from the continuous spectrum. 

We therefore obtain the component of A(t) which oscillates with frequency 
co nm = £„ — e m as follows: 


A nm (t ) = E n A( 0)E m e Ua - t + E m A(Q)E n e~ (1.12) 

If we substitute the corresponding term d nm for d into (1.11), we then obtain 
the corresponding field intensities EJJ and B ( n l J. 

The Poynting energy-flux vector S is given by (l/47r) E x B and we there¬ 
fore obtain 



J_ 1 

An \x | 2 




x 

\x\‘ 


(1.13) 


In order to avoid misunderstanding, we emphasize the fact that 5c is a param¬ 
eter which has nothing to do with the operator d: x is, for example, the 
position of the slit in the spectral apparatus, through which, with the help 
of (1.13) the energy stream is computed, while d is the operator which 
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describes the motion of the electron relative to the nucleus!. The direction 
e = x/\x\ from the radiating atoms to the slit of the spectral apparatus 
is therefore associated with the apparatus. 

Let AO denote an element of solid angle; the energy passing through a 
surface area | x\ 2 AO per unit time in a beam of radiation is given by 

^|exd„J 2 AO. (1-14) 

If, for d nm we substitute an expression of the form (1.12), then, since 
E n E m = E m E„ = 0 for m # n, we obtain: 

4 

°^(ex E n d(0)EJ-(e x E m d(Q)E„)AQ 

4 

+ ^ (c X Ej(0)E n ) ■ (e X E n d(0)EJ AO. 

The second term differs from the first only by the series of factors. For the 
translation of the classical formula according to the correspondence principle 
there is an uncertainty. Here the physical consideration is that energy can 
only be emitted when there is an accompanying energy loss, that is, if the 
atom (for e„ > e m ) makes a transition from state n into state m. That is, there 
is no radiation with frequency co nm if the atom is in state m. This is, however, 
the case only if the second term is written in the same order(!) as the first. 
Thus, for e„ > e m the energy emitted in the direction e into the solid angle 
AO per unit time is given by: 

4 

AH (e x E n d(0)EJ ■ (e x E m d(0)E n ). (1.15) 

If we divide this expression by the energy co„ m of a photon, we find that the 
“effect” that, in the time interval At a photon of frequency cu„ m is emitted in 
direction e into the solid angle AO is given by 

A (2) F„^ m = At AO(I x E n d(0)EJ(e x E m d(Q)E„). (1.16) 

By integrating over solid angle, we then find that the effect that, in the time 
interval At, a photon of frequency co„ m is emitted is given by 

A U) F„-. m = At f^EJ(0)E m • d(0)E n , (1.17) 

where we note that j sin 2 6 d£l = 8ti/3. 

Given an ensemble corresponding to the operator We K, the probability 
for the “effect” A ( „ 2 2 m F or A^ m F is given by tr(!F(A.F„_ m )). Effects for other 
frequencies co which are not transition frequencies do not occur, that is, are 
zero! 
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The formulas (1.16), (1.17) therefore describe the fact that the frequencies 
of the spectral lines are to be computed according to the “Bohr frequency 
condition” a) nm = e„ — e m . Furthermore, with the help of (1.17) we may 
confirm the fact that the effect A (1) F„_ m (and therefore also A ,2) F„ _ >m ) are, for 
different m and n, coexistent (IV, §1.2), a fact which is necessary for the 
“combined registration” of different spectral lines by the same spectral 
apparatus (registration method). In order to show that this is the case we 
shall now estimate 

A (1) F„^ m < Atf (o 3 nm E n ri nm , 
where rj nm is the largest eigenvalue of the operator 

E n mE m .d(0)E n . 

By means of the expression 

4 

1 = T Z 

3 m 

(£n < £m) 

a time z„ is determined for which t„~ 1 is an order of magnitude estimate for 
the width of the spectral line. The time r„ is called the lifetime of the nth state. 
Theformula(1.17) that A 11) F„_ m is proportional to At is only valid for At 4 r„. 
It is, of course, possible to extend the above considerations for the “effect 
of the emission photons” to much longer times t, where we take into account 
the fact that several photons can be emitted in sequence; here, we would 
obtain more complicated expressions which cannot be expressed in closed 
form, since we already know that from the transition probabilities obtained 
from (1.17) a “master equation” of the form VIII (5.4) must be solved. For 
this reason we shall only consider the case in which At 4 We therefore 
obtain 

Z A (1) F„ m < E n 

m 

( £ m £ n) 


from which we obtain 


I A^ m <lE n <U 

n, m n 

(£m<£n) 

therefore we find that the effects A (1) F„^ m are coexistent. 

As we have previously mentioned, we have only introduced the “derivation” 
of (1.16), to obtain the expression (1.16). From here on we will consider (1.16) 
as an axiom for the selection of the operator A (2) F„^ m together with a corres¬ 
ponding (not given explicitly) mapping rule between the characteristics of 
the construction of a “spectral apparatus” b 0 with a registration b for which 
*F(h 0 , b) = AF„^ m where AF„ , m are obtained from A (2) F„_ m by integration 
over solid angle (as measured from the slit of the apparatus) and from other 
factors which are also determined by the apparatus. 
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The “spectral apparatus” b 0 therefore corresponds to an observable (in the 
sense of VI, §1) 


Wbo) 




L, 


where the atoms of the Boolean ring are mapped onto the AF„_ m . This 
observable is, however, not a decision observable! The frequency co may be 
used as a measurement scale in the sense of IV, §2.5. This scale has a discrete 
spectrum with discrete values co nm . Here the scale used has a physically well- 
defined sense in terms of the construction of the spectral apparatus. The 
observable measured by the spectral apparatus cannot be confused with the 
decision observable H (where H is the Hamiltonian operator) although the 
point spectrum of the frequencies measured by the spectral apparatus 
contains the eigenvalues of H as differences a> nm = e„ — e m ! 

The expression for magnetic dipole radiation is similar to that obtained 
above for electric dipole radiation; here we obtain 


£< 2 > = * x m. 


B (2) = 7^72 X I — X 

X 



the Poynting vector is given by 


S (2) = 


1 1 
47t 15c | 2 



x in 


x 

|xf 


Because of this similarity, we shall not list the corresponding formulas. 

As we have noted in VIII, §5, the effect of the emission of light is such that 
the operator for the time translation by y is not precisely given by e lHy . We 
must, in addition, describe the “slow” changes in W by an equation of the 
form VIII (5.4). 

According to (1.17) it is possible to choose, for A nm in VIII (5.4) 
(axiomatically!) 


l4co 3 

A nm = I - 'f-E™ f ° r 6m < £ n > 

A nm = 0 otherwise. 


(1.18) 


On the basis of equation VIII (5.4) the meaning of x„ introduced above 
becomes clearer, since VIII (5.4) describes how an “excited” state decays. 
If W y is the solution of VIII (5.4) then for the probability for the effect con¬ 
sisting of the emission of a photon of frequency co nm in the time interval 
t to t + At we would obtain 


(1.19) 
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2 Ensembles Consisting of Bound States 

In many cases it is relatively easy to “guess” the operator W which corres¬ 
ponds to an ensemble produced by a given preparation procedure from the 
structure of the preparation apparatus. We will discuss an example which will 
enable us to describe many experiments. Again the derivation will be of a 
heuristic nature. The result for W will be asserted as an axiom (together with 
a corresponding set of mapping rules for the preparation apparatuses where 
the latter are constructed according to definite principles) in close analogy 
to what we have earlier described for the case of effect (spectral) apparatuses. 

For the purpose of discussion we shall consider an apparatus which 
produces an extremely attenuated gas—a so-called ideal gas—composed of 
the desired material. Since, in an attenuated state it is permissible to neglect 
the mutual interaction of the individual atoms or molecules, we may con¬ 
sider the individual atoms or molecules to be prepared “in the state of an 
ideal gas.” What operator W are we to assign to the ensemble prepared by 
this preparation procedure? 

According to VIII (1.2) let (we shall neglect the index in Hi)H s + H denote 
the Hamiltonian for the atom or molecule where H s = P 2 /2M is the 
Hamiltonian operator of the center of mass and where H is the Hamiltonian 
operator of the inner structure, where earlier we have called the latter the 
rest-energy observable (see VIII, §1). Let 6 denote the absolute temperature 
of the gas. Then we set (where p = 1/0) 

W= CB s e~ fiH , (2.1) 

where B s is an operator which refers only to the center of mass coordinates, 
and C is a normalization constant determined by the equation tr(JF) = 1. 
(For the basis of this well-known “Boltzmann-ensemble” (2.1) see, for 
example [2], XV, §6.7.) 

The operator B s is, for the most part, not easily determined theoretically. 
The following assumption is often found to be useful for the description of an 
ensemble of an idealized gas: Let us consider the following vectors from the 
“Hilbert space of the center of mass” in the position representation (see the 
vectors of minimal APAq uncertainty in X, §1 and IX, §5): 

/ 2 \ 3/4 

I—j e -ip-<*-*') e -(|s-s'l 2 )/« 2 _ (2.2) 

We shall consider (2.2) to be a function of x in the position representation 
while p, x' are fitted quantities. For a we choose a quantity which corresponds 
to the so-called “mean free path” in the idealized gas present in the prepara¬ 
tion apparatus. The projection onto the vector (2.2) will be denoted by s .. 

We then set 

B s = je- ( » l2M) t 2 P Prx ,d 3 p d 3 x', (2.3) 
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where M is the total mass of the atom (or molecule) and x' is to be integrated 
over all space in which the ideal gas is present in the preparation apparatus. 

Earlier, in §1 we said that we would, for simplicity, neglect Doppler 
broadening of spectral lines. By this we mean that for W in (2.1), B s does not 
contain any “large” velocities which would result in a large Doppler broaden¬ 
ing relative to the resolution capability of the spectral apparatus. Naturally 
the considerations of §1 can be quantitatively improved with the help of (2.1), 
(2.3). We shall not, however, do so because we would only obtain a more 
refined mode of description, which will not, in principle, lead to any new 
viewpoints. 

The formula (2.1) is usable as an expression for the ensemble of the atoms 
(or molecules) prepared by the preparation procedure providing that 8 is 
sufficiently small and (consequently) /? is so large, that is, in 

e~ pB = £«"*»£,n + ' • ' 

n 

only the sum over discrete eigenvalues e„ plays a role. The E„ are, as in §1, the 
projections onto the eigenspaces of H. We therefore assume that the spectrum 
of H which describes the inner structure of the atom (molecule) consists of 
discrete (and bounded from below) eigenvalues e„, and a continuous spectrum. 
The values for the continuous spectrum do not lead to a convergent ex¬ 
pression for tr (W)\ if /i is so small that the continuous spectrum of H plays 
an important role in an ideal gas state, then it is necessary to take into account 
the fact that the gas is actually confined to a finite volume, and it is necessary 
to replace H by an H' which takes into account the effect of the walls of the 
enclosure. The continuous spectrum of H describes the so-called unbounded 
states, in which some of the components of the atoms (molecules) are scattered 
through the entire space. We then can no longer speak of atoms or molecules 
as the microsystems prepared by using an idealized gas. The discrete spectrum 
of H is usually referred to as the set of “bound states.” Therefore (2.1) is 
usable only if we may replace e~ m by means of a finite sum e~ fl ' n E n over 
the lowest eigenvalues e„ of H. If the distance from the smallest eigenvalue e 0 
to the next higher e 1 is sufficiently large and 8 is sufficiently small (that is, 
ft is sufficiently large), then we may replace e~ llH (because a numerical factor 
can be taken into C from (2.1)) simply by E 0 . In this way we can recognize 
the great role played by the ground state of an atom or molecule: An en¬ 
semble W a; E 0 is easy to produce. In the theory of chemical bonding we are 
primarily interested only in this E 0 and the energy eigenvalue e 0 , whose 
structure we shall investigate extensively in XV. 


3 The Spectrum of Hydrogen-like Atoms 

A hydrogen-like atom consists of a nucleus of charge Z and a single electron 
(see VIII, §§2 and 5). Here we shall only be concerned with the so-called inner 
degrees of freedom, that is, we shall ignore the motion of the center of mass. 
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We shall also ignore the spin of the electron. We will assume that the Hamil¬ 
tonian operator is given by: 



+ V(r), 


where 


V(f) = 


Ze 2 

r 


(3.1) 


In this expression m is the reduced mass, which, because of the relative masses 
of electron and nucleus, can be taken to be that of the electron; r is the distance 
from the nucleus to the electron; P is the operator of (relative) momentum, 
which is practically equal to the momentum of the electron. 

Since the operator (3.1) has spherical symmetry, we may apply the results 
concerning rotations and angular momentum obtained in VII, §3 and VIII, 
§1. Since H commutes with all rotations about the center of mass, the sub¬ 
space .T l m spanned by (p v (r)Y l m (6, <p) for fixed m and l is invariant under H, 
that is, H only acts upon the variable r in <p v (r). We may therefore choose the 
ip v (r) to be (proper or improper) eigenvectors of H. 

Since H transforms the different .T l m (l fixed;m = — 1,...,/- 1,0 

isomorphically, we may choose the same cp v (r) as the eigenvector of H for 
all of these values of m. We will characterize these cp v (r) by two indices n and l: 
cp„i(r ) instead of <p v (r). All the cp„, Y l m where m = —/,—/ + 1,...,/— 1, / 
belong to the same eigenvalue e nl of H. s nl is therefore degenerate of order 
(21 + 1). The eigenspace of H corresponding to the eigenvalue e„, is invariant 
under the action of the representation Q) l of the rotation group. The basis 
vectors cp nl Y l m of this eigenspace are eigenvectors of L 2 and L 3 with eigen¬ 
values 1(1 + 1) and m, respectively. 

The above assertions are direct consequences of general results; we will 
now confirm these results directly by computation. For this purpose we shall 
divide the momentum P into two components, one in the direction x and 
the other orthogonal to x. Only the component which is orthogonal to x 
contributes to the angular momentum L = x x P. It follows that 

L 2 = (x x P) ■ (x x P) = x ■ [P x (5c x F)] 

3 

= E (XtPkXtPk - x k P,x,P k ). 

i,k = l 

By taking into account the commutation relations and the relation 


we obtain 



P 2 = 



In spherical coordinates we obtain 


L 2 


1 d ( d 1 8 2 \ 

sin 9 30 \ m ® d0 + sin 2 6 d<p 2 / 


(3.2) 


(3.3) 
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Since 

we therefore obtain 

P 2 (PrnY l m = 


£ 2 Y i m = in + i)n. 

1 d (,.2 d( Pnl , 1(1 + 1) 
“r o 


r 2 dr \ dr 


(Pnl 


Y l 

1 m • 


Thus for (p nl we obtain the equation 

1(1 + 1) 


If 1 d ( 2 d(p nl \ 1(1 + 1) 1 T „ . 

s l r “sy + H + <34) 


2m 

For V(r ) = — Ze 2 /r (3.4) becomes 


„ 2 , / _b c '. 

+ r <Pni + I a + 2 - + )<p„; = 0, 

r \ r r ' 


(3.5) 


where 


a = 2me ni , h = mZe 2 , c = —1(1 + 1). 

We must distinguish between two cases: e nl < 0 and e nl > 0. In the first case 
we may, according to the dimension of a, set 

1 


= 


a 


(3.6) 


From (3.5) we obtain the asymptotic behavior for large values of r as follows: 


" 1 n 

(Pnl - -T (p„i~ 0, (p nl ~ e 


±r/r o 


Since cp nl is an element of Hilbert space, for large r we must have 

(Pnl ~ e~ rlro - 

For p = 2r/r 0 we make the substitution 

(Pnl = e~ pl2 u nl (p) 

and for u nl (p) we obtain 


(3.7) 

(3.8) 

(3.9) 


Kl + ~ 1 JU'nl + 

For a trial solution we set 


b _ x \l 1(1 + 1) 


u nl = 0. (3.10) 


Ki(p) = P X Z 

li^O 

where a 0 + 0. By substituting the trial solution in (3.10), and equating powers 
of x we obtain 


X(X - 1) + 2X - 1(1 + 1) = X(X + 1) - 1(1 + 1) = 0 

from which we conclude that either X = lor X = — l — 1. The second solution 
of (3.9) has a singularity at r = 0 and does not belong to the domain of 
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definition of H (VIII, §5). Therefore, only the first solution A = / is usable. We 
therefore set 


00 


Unl = P l Z a nP fl - 

H = 0 


(3.11) 


By substituting (3.11) into (3.10) we obtain the following recursion relation 
for the a^: 


t(ji + / + l)(/t + 0 + 2(/r + l + 1) 

b 


Kl + 1)K +1 


— I /i -f / 4- 1 — 


(3.12) 


If the series ^ ff does not have a finite number of terms then, for large r, 
u nl will behave like e +p because, for large p 


a 





Vnl ~ U nl e 


-Pl 2 


behaves like (for large p) e +pl2 and therefore cannot be an element of Hilbert 
space. Therefore, the series must terminate; this may be the case for p = n v . 
Thus a„ v+ j = 0, and, according to (3.12) we must have 



n v + l + 1 = n 


(3.13) 


from which it follows that 


me 4 Z 2 

~2rS~ 


(3.14) 


From (3.13) we obtain the following condition for n: 

n > l + 1. (3.15) 

In the domain e < 0, H only has discrete eigenvalues. The eigenvalues s„ 
are graphically represented in Figure 2. They determine the spectrum of the 
hydrogen atom by the equation 



with the Rydberg-constant 



In the “term-scheme” Figure 2 the perpendicular lines identify the transitions 
which lead to spectral lines co nm . For m = 1 and n = 2, 3,... we obtain a 
series of spectral lines, the Lyman series, which lies in the ultraviolet. For 
m = 2 and n = 3,4,... we obtain the Balmer series, for m = 3, n = 4, 5,... the 
Paschen series, etc. The entire spectrum for hydrogen and its decomposition 
into series is represented in Figure 3. 
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Figure 2 Term scheme of hydrogen. 




3 The Spectrum of Hydrogen-like Atoms 77 


A.A.E. 

912 


1000 


1200 


1500 


2000 


3000 


WOO 

5000 

6000 \ 

8000 

10000 

20000 


US 

Hy 

"/> 

Ha 


I 

o 

ai 

to 

■4 

o 


972.50 

1025.83 


1215.68 


3970.075 

0101.738 

0300.086 

0'861.327 


6562.793 

95082 


10938 

12818.1 

18751.1 

2.63/L 

0.05/1 


<s,' 


ai 

to 

c 

<0 

I 


372.5V 

7025.83 


1215.68 


is' 

ii 

Ri 


XLr 4 

II 


Co 


CD 

<0 

no 


1 He. 
HO 

r/o 


Ha 


3970.075 
V701738 
V3V0.V66 
V861.327 


6562793 


to 

*cP 

is' 


|«V3 


II 


Co 

c: 

QJ 

< 

CO 

! 

I s1 


, 95V6.2 
k 700V9.8 
10938 
72818.17 
78757.05 


Os 

to' 

ii 


II 

CD 

CO 

05 

•g 

2 

on 


2.53/1 

0.05/1 


cnv ? 

109678 


700000 


90000 


80000 


70000 


60000 


50000 


70000 


30000 


20000 


10000 


Figure 3 Spectrum for hydrogen. 
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The eigenvalues e„ of H are multiply degenerate. To each jtair of values 
n, l there are 21 + 1 independent eigenvectors of H and of L. In addition, 
for fixed n the “angular momentum quantum number” / can take on values 
0,1— 1. Therefore the degree of degeneracy of e„ is equal to 


n- 1 


E (2/ + 1) = 

1=0 


n 


2 


From symmetry arguments, that is, since H commutes with L 3 and L each 
eigenvalue e nl must be (21 + l)-fold degenerate. For the case in which 
V(r) = —Ze 2 /r the eigenvalues e nl for / = 0, 1,..., n — 1 “accidentally” 
coincide. If V(r) is not the Coulomb potential, then we would expect a greater 
manifold e„, of energy eigenvalues. 

From experience with hydrogen, and the corresponding ions of the 
elements helium, lithium, etc. the Balmer terms (3.14) have been exceptionally 
well confirmed. Only under extremely high resolution is it possible to show 
that each term is not a single term but consists of multiple adjacent terms. 
We shall return to this topic later in the book. 


4 The Eigenfunctions for the Discrete Spectrum 

If we write u nl = p l w nl (p) then w nl will be a polynomial of n v th degree 
(n v = n — l — 1). From the recursion formula (3.12) it follows that 

w n ,(p) = c nl F(-n + l + 1, 21 + 2, p), (4.1) 

where c nl is a constant factor and F( a, /?, p) is the confluent hypergeometric 
function defined by 


. , ot x a(a + 1) x 2 

F(xJ,x)-\+-- + WTT) - + 


(4.2) 


The w nl (p) may also be written in terms of Laguerre polynomials (see X, §1). 
The latter are defined by 


F1(x) = ~-L k (x)\ 


L x(x) = (! x (e X x'). 


(4.3) 


We obtain 




v = 0 


(4.4) 
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and 

L"(x) = (-lfAlQfCA - p, n + 1, x). (4.5) 

Thus we obtain 

( _l) 2i+1 

W ni(p) = c nl - , V Lift '(p) = D nl L 2l + + , \p). (4.6) 


The factor c„ ; or D nl is to be chosen such that the entire eigenfunction 

D nl p l L 2 n l + Y(p)e-* 2 YUe , cp) 


is normalized. Thus we obtain 


D 2 n,{jf Jp 2<i+I) e-[L„ 2 'V(P) 


dp = 1, 


where r 0 is given by (3.6) and (3.13) since the Y‘ m are normalized on the surface 
of the sphere. We obtain (see [9], Band II, p. 84): 


, (2Zme 2 \ 3 (n — / — 1)! n 

D 2 = I -) ^and r 0 = 


n I 2 n[(n + /)!]" 


Zme 


Clearly 1/me 2 is the approximate size of the hydrogen atom (Z = 1) in the 
ground state (n = 1). 

Finally, in terms of the notation introduced in IX we obtain 


<x 1; x 2 , x 3 |e„, /, m> 


(2 Zme 2 \ 312 j(n — l — 1)! 

\ n j 2n[(n + /)Q 3 

( 2 -) l L 2l + V (2-)e~ r "°Y' m (6,(p), 
\ r o) \ r 0 J 


(4.7) 


where r, 9, (p and x,, x 2 , x 3 are related by the equations Xj = r sin 0 cos cp, 
x 2 = r sin 6 sin cp,x 3 = r cos 9. The quantities <x 1 ,x 2 ,x 3 |e„, /,m) are also 
the coefficients of the position representation in the ( H , L 2 , L^-representa¬ 
tion. The set of these coefficients is not complete, because all values e > 0 
belong to the continuous spectrum of H, as we shall see in the next section. 

In Figures 4 and 5 we exhibit the behavior of the radial functions cp nl (r). 
In Figure 4 the function itself is given. In Figure 5 the charge density within 
a spherical shell of radius r to r + dr is given. 

A very intuitive picture of the probability density |<x 1; x 2 , x 3 |e„, l, m}\ 2 
is given by the White models (see Figure 6). 
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Figure 6 Probability density |<x, x 2 x 3 |s n l m}\ 2 


5 The Continuous Spectrum 

In §3 we have assumed that e < 0 and therefore a < 0, so that we set (accord¬ 
ing to (3 6)) fo = — 1 /a We will now consider the case in which a > 0 For 
this case we set rl = 1/a The asymptotic behavior of <p nl is given by 
cp nl ~ e +l(rlro) instead of by (3 7), from which we conclude that for all values 
e > 0 the eigenfunctions are improper 
The computations of §3 can be carried out if we set p = 2i(r/r 0 ) For 
( p„i — e p 2 u nl (p) we again obtain equation (3 10) The power series expansion 
(3 11) is also valid, the only difference is that the power series does not ter¬ 
minate We shall now replace e by the quantity k where k is defined by the 
equation k 2 /2m = e, k is the wave number of a free electron with energy e 
We then may write 


n = 



Zme 2 
ik 


( 51 ) 


where n is, of course, no longer an integer In order to obtain a closed 
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analytic expression for the above power series we shall begin with the 
formula (4.3) 

L M = eP £- n (p" e ~ p ) 

for the Laguerre polynomial. 

If/( p ) is an analytic function, then the following integral expression holds 
in general 

— ~ f(p) = 1 ~~tt dz, 

n! dp J 2niJ(z - p)" +1 

where the integral is taken over a closed path which contains the point z — p. 
Thus it follows that: 

L n (p) = ^e p j>z"e-\z - p)“- 1 dz. (5.2) 

This expression is, however, a solution of the differential equation 

pL" + (1 - p)L' + nL = 0 (5.3) 

even in the case in which n is not a positive integer. Here we only assume that 
the integration path is closed. The fact that equation (5.2) is a solution of the 
differential equation (5.3) can be verified by substitution and integration by 
parts. The path is closed for arbitrary n (not necessarily only integer or real 
values of n) if we choose an integration path such as that illustrated in Figure 
7 in which the changes of the integrand which result from the branch points 
at z = 0 and z = p are compensated. 

According to (3.9) we can write the eigenfunctions as follows 
< Pni(r)Y l m (6, <p) where 

(pni = D'e~ p,2 (ip) l L 2 ‘ +7 (p), (5.4) 

that is, 

<p nl = De~ pl2 ( — ip) 1 j)(z + p) n ~ l ~ 1 e~ z z~"~ l ~ 1 dz, (5.5) 




Figure 8 
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Figure 9 


where p and n are purely imaginary, and the integration path [obtained by 
substituting z -*■ z + p] is given in Figure 8. The integral can easily be 
expanded into a power series, since the integration path can be enlarged such 
that the expansion of (z + pf ' 1 ~ 1 in terms of p lies in the convergence 
domain. In this way we obtain the power series expansion (5.2) which was 
introduced earlier. 

We are particularly interested in the bahvior of <p„, for large values of r. 
For this purpose, let us make the substitution z = p(u —j) in (5.5). We obtain 

Vnt = D (ip)~ 11 +i)"~ !_1 (« - i)~ n ~ l ~ 1 e~ pu du, 


where the integration path is the closed path in Figure 9 which can be 
replaced by both loops in Figure 9. We may write q> nl = Ylni + ri nl ) where y 
and fj are the integrals over each of the loops. If we expand the integrands in 
powers of 1 jp, then the asymptotic behavior for large r is given by 


Ini ~ 


e Pl2 p" 1 


T(n + l+ 1)’ 


Vni ~ e pl2 (~P ) " 1 


g-ra(n + i/ 2) 

r (-n + 1+1) 


and for cp nl we obtain 


(Pnl ~ 


sin(/cr + y) 
C kr 


(5.6) 


where 


g-(n/2)|«| 

| T(n + l + 1)| ; 


y = | n | log 2 kr + a — — /. 


Here a is determined by F(n + l + 1) = |F(n + l + l)\e m . Thus, up to a 
normalization factor N, for the continuous spectrum <5c|e, l, m) is given by 

<x|e, l, m) = Ncp nl (r) Y l m (9, (p), 


where n is the value given by (5.1) and cp nl is given by (5.4). 
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6 Perturbation Theory 

For spherically symmetric potentials other than V(r) = Ze 2 /r it is, for the 
most part, not possible to obtain an exact calculation for the spectrum of H. 
Often, for problems which cannot be solved exactly, it is possible to obtain a 
satisfactory “approximate solution” by using the “method of perturbations.” 
We will now describe a frequently used general perturbation procedure (in 
§§6 and 7 we will discuss this problem in a more general way than is required 
for the case of one-electron systems). The method consists of replacing the 
Hamiltonian operator H by an approximate operator H' which we assume 
(in the sense of the discussion in [8], §8) is less precise than H, but describes 
experience with a certain more or less well-known accuracy. The investiga¬ 
tion of perturbation theory presented in [10] is nothing other than a mathe¬ 
matical attempt to determine the corresponding “imprecision sets” (see [11] 
and [8], §§5 and 6), and therefore to solve the problem outlined in [8], §8 
concerning the relationships between two theories, one more precise than 
the other. Here we shall only outline a procedure by which we may obtain 
an approximate H' from an exact H (which permits us to mathematically 
treat the problem) in the hope that this H' will be useful (that is, the corres¬ 
ponding theory with a given imprecision set in the sense of [8], §§5 and 6 and 
[11] will be useful). 

This procedure is tailored to the discrete spectrum for H. It can be shown 
that in the physics of atoms and molecules the Hamiltonian operators can 
only have discrete eigenvalues which are at most finitely degenerate. This 
fact will be used in the following procedures. 

First Step. We seek to decompose the Hamiltonian operator H into two 
parts as follows: H = H 0 + H u such that H 0 contains as much as possible 
of H, and for which the eigenvalue problem of H 0 can be solved. We may, for 
example, instead of including V(r) in H, seek to obtain an approximate 
potential V 0 (r) such that V 0 (r ) contains the “essential” part of F(r), but for 
which the eigenvalue problem of H 0 (which contains F 0 (r)) can be solved. 
H 1 then has the form H 1 = V(r) — V 0 (r). We shall call H 0 the zeroth 
approximation and H 1 the perturbation. 

The discrete part of H 0 may be written in the form £„4 0> £ ( „ 0> . According 
to the previous remarks, each of the projections £' 0) is finite dimensional. 

Second Step. We consider the discrete spectrum e ( „ 0> and determine whether 
several eigenvalues are “densely spaced” in groups. Two 4°* and 4°* are to 
be considered “densely spaced” if the perturbation H 1 displaces the eigen¬ 
values more strongly or about a magnitude | ej^ - 4°' |. If all the eigenvalues 
are densely spaced in this sense, then we can select arbitrary intervals from the 
energy scale and collect all 4 0) which lie in the interval into a group; we must 
then take into account that the computation of the displacement of the 
eigenvalue 4 0) under the perturbation is incorrect in the neighborhood of the 
interval boundaries. This can be managed to some extent by the use of over¬ 
lapping intervals. We will not consider these more complicated cases. 
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We therefore assume that the 4 0> (or at least those which are of interest with 
respect to the perturbation) are either widely separated or are divided into 
groups (e^v) (where v is the index of the group) which are widely separated 
and contain only a finite number of indices n ; in a given group. For 
£ v = £„. (where the sum is taken over the elements n f of the vth group) 
we construct the operator 

H (l) = £ e ( „ 0) £l 0) + £ E v h,e v = £ E v he v . (6.1) 

MV V 

Here we call H (i) the operator of the first approximation, H = H a) + H 2 
where we call the remainder H 2 the perturbation of second order. 

The solution of the eigenvalue problem for H (1) consists of the clearly 
finite-dimensional eigenvalue problems for the various H v = E V HE v . For 
the special case in which £ v is identical to E ( „ 0) (that is, 4 0) is not a neighboring 
eigenvalue to other eigenvalues of H 0 ), then 

H a) = E ( ° ) HE ( ° ) = £< 0) E ( „ 0) + E ( „ 0) i/ 1 E ( „ 0) . 

Since, in the finite-dimensional subspace determined by E ( „ 0> the operator 
4°>E<°> is a multiple of the unit matrix, we need only determine the eigenvalues 
of E ( „ 0> H 1 E ( „ 0> . The eigenvalue equation for E V HE V , or for E^Hi £ ( „ 0i , is often 
called the secular equation. If we denote the eigenvalues of E^H^E^ by 
44 then, as a first approximation, that is, for the eigenspace of H w we obtain 
the values 4 0> + the eigenvalue for the zeroth approximation 4 0) is, for 
different a, split into the eigenvalues e*, 0) + 41* of the first approximation. If 
E ( „ 0> is one-dimensional, then we naturally do not find any splitting, and we 
obtain 4° = <4 0) |#i |4„ 0> ), that is, the expectation value of Hi in the 
eigenstate of H 0 for the (nondegenerate) eigenvalue e ( „ 0> . 

We have characterized these two important steps of the zeroth and first 
approximations without making any assumption concerning the series 
expansion in order not to unnecessarily burden the approximation procedure 
with the problem of the expansion of the eigenvalues and eigenvectors in 
power series of a parameter. For such questions and for questions concerning 
the validity of such approximations we refer the reader to [10]. Here we 
shall instead seek to formulate an additional approximation step. 

We shall begin by considering the decomposition H = H a) + H 2 . This 
decomposition has the following properties: Let Ej/* denote the projections 
onto the eigenspace of H (l) with the corresponding eigenvalues ej/’ (not to be 
confused with the above 44-' on the contrary, in the above case it is necessary 
to set fit” = 4 0> + 44). Then from the definition of H and from the condition 
that no two eigenvalues ejj, 1 ' f rom different groups v may coincide it follows 
that all expressions of the form E { ^ > H 2 E' ll n must be equal to 0 (otherwise, the 
groups were incorrectly chosen). Since all the E { I } ) H 2 E > ^ ) = 0, we must seek 
the effect of H 2 in another way than that described in the “second step” above. 

In order to make an additional step (a third step with respect to the previous 
two) we must transform the eigenvectors of H (1) by a unitary transformation 
at least approximately) into the eigenvectors of H (1) + H 2 - 
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We therefore seek a unitary transformation U such that for 


E; = U + E ( I } ) U (6.2) 

the nondiagonal terms + H 2 )E' P for fi # p are as small as possible. 

In this way the spectrum of H (1) + H 2 can be approximated by the solution 
of the finite-dimensional eigenvalue problem E' fi (H (1) + H 2 )E' p . 

From (6.2) it follows that 


£;(H (1 > + H 2 )E' p = U + E^U(H W + H 2 )U + E^U. 

In order to establish a reference point to compare the magnitudes of different 
terms we shall apply a factor A to H 2 and set U = e ikS . In order to obtain 
minimal nondiagonal elements we shall choose S such that the expansion of 

e a \H (l) + XH ( 2 ] )e~ iXS 


has no terms of first order in A. This will be the case if 


H 2 = i[H (1) , SI 

(6.3) 

This condition is satisfied for 


c _ v E["H 2 El 1] 

b i(z[ r> - 4 1 *)' 

(6.4) 


V,/X 


From which we obtain 

e as (H w + XH 2 )e~ as = tf (1) + y i[S, tf 2 ] + • • •. 

For the approximation operator for the third step we select 

H' = £ e iS E^ ) ^H a) + l - [X, H^E^V 5 . (6.5) 

In order to find the eigenvalues of H' in (6.5) it suffices to solve the finite¬ 
dimensional eigenvalue problems of 

+ ( 6 . 6 ) 

because the multiplication by e -,s on the left and by e iS on the right does not 
affect the eigenvalues, but only the eigenvectors. The eigenvectors of (6.6) 
need only be multiplied by e iS in order to obtain the eigenvectors of (6.5). 

The eigenvalue problem (6.6) again has the form of a secular equation. 
Using (6.4) the operator (6.6) is given by 


F (1) H F a) H E a) 

p(l)pd) _L_ V n 2 n p 

> ^ L „(1) _ p (l) 

p(*p) C P C P 


(6.7a) 
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If the eigenvalue of H {1) is nondegenerate, that is, if £*/> is one-dimensional, 

then from (6.7a) we obtain the eigenvalue 


e 


(i) 

n 


+ 1 

P(*P) 


&»\H 2 E<»H 2 \%'> 

pd) _ pd) 


(6.7b) 


In (6.7a) or (6.7b) the sum over p can be extended as an integral over the con¬ 
tinuous spectrum of H (1) as follows: Let E <1> (e) be the spectral family of H (1) 
(thus Ej/’ will be the discontinuities of E U) (e) at locations ej/’). Then, in 
general, we may write: 


s^E^ + lim 

< 5->0 



E^H 2 dE <l) (i;)H 2 E\l ] . 


(6.7c) 


7 Perturbation Computations and Symmetry 

We will now continue our discussion of the problem described in § 6 . We 
shall assume a decomposition of the Hamiltonian H of the form H = H 0 + H 2 
as in §6 where we assume that the spectrum of H 0 is known. Instead of 
beginning with the perturbation computation described in § 6 , we shall 
consider an operator H(X) = H 0 + AH, which depends on A (in the interval 
0 < A < 1 ) and seek to determine what we can about the A dependence of 
eigenvalues and eigenspaces for the operator H(A), in order to obtain infor¬ 
mation concerning the spectrum of H( 1) = H 0 4- H l . 

Later we shall encounter Hamiltonian operators which depend on one or 
more different parameters (see, for example, the discussion of molecular 
spectra in XV, §§3 and 5). In our discussion of this problem we will permit the 
parameter A to take on values which are points in a complete metric space A 
in order to obtain general statements concerning the dependence of the 
spectrum of the Hamiltonian operator H(A) upon A. 

We will now assume that the lower bound of the continuous spectrum of 
H(A) is a continuous function 77 (A); often 77 (A) will be constant. We are only 
interested in the discrete portion of the spectrum where the latter is below 
77 (A). If, for a particular value of A there are infinitely many discrete eigen¬ 
values of //(A) below ? 7 (A), then let 77 (A) be the one and only one accumulation 
point of these discrete eigenvalues. 

Furthermore, it may be possible to number the eigenvalues of this spectrum 
below the continuum in such a way that we obtain continuous functions 
e„(A) for which not all the e„(A) need be defined for all values of A, namely when 
the discrete eigenvalue s„(A) exists only for an open subset A„ of A and on 
the boundary of A„ approach the value of the continuum limit. We then say 
that the eigenvalue e„(A) emerges from the continuum in the transition from 
points outside A„ to points within A„. 
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The mathematical problem, to find conditions for H(X) under which the 
above assumptions and other assumptions to be made later are satisfied is 
investigated in [10]. 

Some of the eigenvalues e/A) can, for definite points A, coincide. We will 
therefore not assume that the different functions e/A) are different for all 
values of A. On the contrary, it is precisely those points A and their immediate 
neighborhoods where several e„(A) are equal which are of special interest. 

For each point A the projection operators E/A) onto the eigenspaces 
corresponding to the dififerent(!) eigenvalues e are defined where e may only 
run through the different values of the e„(A) (where A is fixed). Therefore, in 
general, it is not possible to define E , nU) ! Therefore we cannot make any 
assumptions concerning the continuity of the A dependence of E/A). A 
meaningful continuity assumption for the E/A) can only be formulated as 
follows (here we assume that all the E/A) are finite dimensional): 

For fixed A and e we define the subset JV(A, e) of all n for which i:„(X) = e. 
For each point A' we set o(A', A) = {e'| e'= e/A') and n e /V(A, s)} and 
E/A', A) = Ej/A'). We therefore obtain E/A, A) = E/A). We say 

that E/A) is continuous at the point A if there exists a neighborhood Y of 
A for which E/A', A), as a function of A' (that is, for A' e Y) has the same 
dimension as E/A) and E/A', A) (as an element of ^') is a continuous at the 
location A' = A. 

We now assume that E/A) is, in this sense, continuous in all of A. The above 
neighborhood Y need then only be chosen so small that in Y the e/A') with 
n e iV(A, s) does not coincide with another e„/A) for n ' A/A, s). 

Since the functions s„( A) are continuous, there exists a neighborhood Y of A, 
for which the number of different values taken on by e/A') for A' ^ A, A' e Y 
and for the n e A (A, e) is constant (A(A, e) is finite, because the e/A) have at 
most one >/A) as an accumulation point!). We say that the eigenvalue e splits 
in the neighborhood of A into the eigenvalues e'(A') where e'(A') runs through 
the different values of e/A') for n e A(A, e). We may make statements con¬ 
cerning this description of splitting if H(X) commutes with the unitary 
operators of a group representation for all A. 

If U is such a representation operator, then the commutivity of H( A) 
with U (see AIV, §8) means that all E/A) commute with U, that is, the finite¬ 
dimensional subspace i/A) of the Hilbert space determined by E/A) is 
left invariant by U and is therefore a representation space for the group. Here 
we shall only consider compact or finite groups. We will show that the 
reduction of the representation into irreducible parts in i/A) has implications 
for the possibilities for the splitting of the eigenvalues e in the neighborhood 
of A. 

As we assumed above, there exists a neighborhood Y of A for which 
E/A', A) has the same dimension as E/A) and E/A', A) —<• £(A) in the rx-topology 
if A' -> A. We will now show that E/A', A) converges to E/A) even in the trace- 
norm topology. 

Let the dimension of E/A) be m and let <p v (v = 1,..., m) be a complete 
orthonormal basis for the corresponding subspace i/A). Because of the weak 
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continuity we obtain (cp v , £/A', A)<p v ) = ||E C (A', A)<p v || -> <<p v , £ £ (A)<p v > = 1 
and therefore E t (X', X)(p v -> <p„. Thus it follows that there exists a neighbor¬ 
hood of A such that we may choose a complete system of orthonormal vectors 
i// v (A') (v = 1,..., m; £ e (A', A) has the same dimension as E,.(A)) for £ e (A', A)Jf 
such that || i/i v (A') — <p v || < 5 for all v. From this result we again find that in 
the trace norm ||£ e (A', A) — £ e (A)|| s -> 0 for A' -> A. 

Since, for each unitary operator 

l|tr(£ f: (A', A)C7) - tr(£/A)(/)|| < ||E/A', A) - £/A)|| s 

the character xAd) of the representation of a group into the subspace n e (A', A) 
corresponding to £/A', A) is continuously mapped into the character y k (g) 
of the representation in 4 e (A), from which we obtain the following estimate 
(which is independent of the group element g ) 

I iM - Xx(g)\ ^ W, A) - £ e (A)|| s . 

Since the number n„(A') with which the gth irreducible representation 
occurring in 4 E (A', A) can be calculated either by means of an integral of 
y_Ag) over the compact group space (finite volume!) or by means of a finite 
sum over the group elements (AV, §§8 and 10.5) n^(A') is continuously mapped 
into n/A), that is, n^(A') is constant in the neighborhood of A. 

The splitting of the eigenvalue e into eigenvalues s' in the neighborhood 
of A can only occur in such a manner that the sum of the multiplicities of the 
eigenvalues s' remains constant. Therefore s can be split into as many s' as 
there exist irreducible representations (counted according to their multi¬ 
plicities) in the eigenspace corresponding to £. 

Of particular interest is the case in which H(A 0 ), for a particular value 
A 0 of A, commutes with the representation operators of a group r S but, for 
A # A 0 only commutes with the representation operators of a subgroup # of 
( S. The irreducible representation of in ^.(A 0 ) splits into several irreducible 
representations of #. This splitting of an irreducible representation of ( S into 
several irreducible representations of the subgroup # determines by pure 
group-theoretical methods the splitting of an eigenvalue e in the neighbor¬ 
hood of A 0 . 

The previous result is one of the ihost important tools for an understanding 
which permits us to make at least qualitative statements concerning the 
spectrum of a Hamiltonian operator when a quantitative computation 
appears to be impossible. Here the consequences of representations in the 
eigenspaces becomes more important because the latter determines the 
structure of the operators for the effect of the emission of a photon as de¬ 
scribed by (1.16) and (1.17). We will consider this topic in more detail in 
Chapter XIII. 

We shall now extend our discussion of the splitting possibilities for an 
eigenvalue by considering the representation of a group in the case of the 
perturbation calculation described in § 6 . 

The decisive step in the perturbation computation is the secular equation 
E V HE V in (6.1) or £^(H (1) + (i/2)[S, H 2 ])E£> in ( 6 . 6 ) or (6.7a). 
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If H 0 and H l commute with the representation operators of the group, 
then the following operators H, E v , E { p \ S, H 2 also commute with the 
representation operators. The secular equation than has the following form: 
In a finite-dimensional subspace t of a Hilbert space (the subspace t corre¬ 
sponding to E v or to Eft ’) we are given a unitary representation of a group and 
an operator H" (= E v HE V or E^\H W + (i/2)[S, H 2 J)E^ ) ) which commutes 
with the representation operator. We seek the eigenvalues for the operator H". 

In AIV, §14 we have solved the problem of obtaining the most general form 
of an operator El" which commutes with a representation. If M is the set of 
the representation operators for the group then from AIV (14.9) it follows 
that for H" 

H- = £ 2,(1 X i/<“>)2*, 

a 

where the space t can, according to AIV (14.7) be written in the form 

« = £ © x jf ( 2 a) 

a 

and, according to AIV (14.8) the representation operators of the group have 
the form 

A = £ Q a (A (a) x 1)2* 

a 

The secular equation problem for H" is then reduced to the individual 
secular equation problems for the individual H (x> in ( 2 \ 

The dimension of Jf ( 2 a) is, according to AIV, §14, equal to the multiplicity 
with which the same irreducible representation (characterized by the index a) 
occurs in t. If an irreducible representation occurs only once, then { 2 > is 
one-dimensional and H {x) is the operator, the effect of which is multiplication 
by an eigenvalue of H", that is, the solution of the secular equation is trivial. 
For such an a. the x vp in AIV (14.5) (where p takes on only a single index value) 
are all eigenvectors of H" corresponding to the same eigenvalue. The x vp 
obtained from group-theoretical considerations are often said to be the 
“correct” linear combinations. 

The procedure described here to simplify the secular problem is applicable 
to innumerable examples. In order to illustrate this procedure we shall give 
examples; in the following section we will consider the case of the alkali 
spectrum. 


8 The Spectrum of Alkali Atoms 

The spectrum of alkali atoms is very similar to the spectrum of hydrogen 
atoms. In XVII, §3 we will find that the atoms of the alkali metals series of 
the periodic table (lithium, sodium, potassium, rubidium, cesium) can be 
treated as one-electron systems, where the electron (the “optical” electron) is 
considered to move in the Coulomb field of the nucleus and the average field 
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of the remaining electrons. The potential V(r ) for the optical electron has, 
for large values of r, the form — e 2 /r corresponding to the fact that the nuclear 
charge Ze is shielded by the remaining Z — 1 electrons. For very small values 
of r the charge of the nucleus will be completely effective, and we must have 
V(r ) ~ — Ze 2 /r. We may therefore write the potential as follows: 

V(r) = - - + f», (8.1) 

r 


where V x (r) is negative, and nonzero only in the vicinity of the nucleus. Let 
H 0 denote the Hamiltonian of the hydrogen atom and let H x = V 1 ; from 
the discussion of the perturbation calculation in the previous section we 
obtain the following result: The eigenspace 2T n corresponding to the energy 
eigenvalue e„ = — R/n 2 of H 0 may be decomposed by irreducible representa¬ 
tions of the rotation group into 

= "l © (8.2) 

1 = 0 

where t l is the subspace generated by the representation <2> l of the rotation 
group which corresponds to the angular momentum quantum number l. In 
general the eigenvalue e„ is split into n eigenvalues e nl (l = 0, 1 ,..., n — 1 ). 

In the first approximation the “correct” linear combinations in the 
position representation are given by the eigenfunctions in (4.7). The eigen¬ 
values in the first approximation are therefore given by: 

£«; = — + < 8 -, l>m\V t |e„, /, m>, (8.3) 


where <e„, /, m\ V x |e„, /, m> cannot depend on m, and is given by 


<£„,/, mlFJe^/, m) 


(2Zme 2 \ 3 (n — l — 1 )! 

y n J 2 n[(n + /)!] 3 



(8.4) 

e -2(rlr 0 ) r 2 


Since, for fixed n the distribution 



g —2 (!■/>• o)j.2 


is, for increasing /, displaced towards larger values of r (see Figure 5) the term 
e „ 0 is the lowest, and, for increasing /, the values of e nl approach —R/n 2 . 
This is precisely the behavior of the alkali metal spectra. In Figure 10 the 
spectrum of the lithium atom is compared to that of the hydrogen atom. The 
term n — 1 does not exist for lithium; we shall explain this fact in XIV, §3. 
For the other terms n = 2, 3,... we observe the behavior described above. 

In Figure 10 we give the conventional term notation: The terms 
e„o ,£„!,£„ 2 , £„3 are called S,P,D, F, respectively; the remaining terms proceed 
in alphabetical order. 
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9 Electron Spin 

Precise observations of the spectrum of hydrogen atoms and alkali metal 
atoms demonstrate that the terms e„, which we have computed are actually 
split into additional terms. For the alkali metals this splitting is particularly 
transparent: The terms for which / = 0 remain simple. The terms for which 
l 0 consist of pairs of closely spaced terms. 

The Stern-Gerlach experiment, which is described in considerable detail 
in [6], XI, §7.2, demonstrates that the ground state e 10 of hydrogen and the 
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e 20 of the lithium atom are split into two components. This means that we 
cannot consider the electron to be an elementary object of spin 0. The Stern- 
Gerlach experiment and the splitting of the terms e„, for / # 0 suggests that 
the electron should be described as a spin % particle, as we have already 
described in VIII, §2. 

The Hilbert space Jf for an electron can be described (VII, §2) as the 
product space of a position or momentum Hilbert space with a two- 
dimensional spin space 4 S : Jf = ^ x This is done in the same way in the 
case of the relative motion of the electron about the atomic nucleus (VIII, §2). 

Our previous assumption for the Hamiltonian operator (3.1) can be 
considered to be an approximation H 0 where 

H 0 = H 0b x 1, (9.1) 

where H ob is the operator given by (3.1) which is defined in . H 0 is therefore 
defined in M and has the same spectrum as does H 0b . The degeneracy must, 
however, be doubled, since every eigenspace of H ob multiplied by the two- 
dimensional i s yields the corresponding eigenspace of H 0 . 

The “improved” Hamiltonian operator described in VIII, §5 may be 
written in the form 

H = H 0 + H v (9.2) 

10 Addition of Angular Momentum 

According to VII, §3 in ^ x i s the rotation group (as a subgroup of 
the Galileo group) about the origin (for the relative coordinate system used 
here, about the nucleus) may be represented by unitary operators of the form 
U D = V D x R d ( D is an element of the rotation group) where R D produces 
the representation <^ 1/2 in t s . We call the representation in the product 
representation of the two representations in ^ and i s . The Hamiltonian 
operator H given by (9.1), (9.2) will therefore not commute with V D x 1 as 
was the case with H 0 = H 0b x 1. In the absence of external fields we may, 
according to VIII, §1 expect that H is spherically symmetric, that is 

U d H = HU d . (10.1) 

Thus, for the angular momentum operators given in VII (5.3), it follows that 

J V H = HJ V , where J, = L, x 1 + 1 x S v . (10.2) 

From [J V ,H] = 0 it follows that angular momentum is conserved (see 

X, §1), that is, J v = 0. Since L,xl and 1 x S v do not commute with H there 
is no exact conservation law for these observables. L,x 1 and 1 x S v remain 
constant with time only in the approximation H = H 0 since 

[L v x 1, Ho] = 0; [1 x S v , H 0 ] = 0. (10.3) 

We note that in VII, §§2 and 7 that we have already called L, x 1 the 

orbital angular momentum and 1 x S, the spin angular momentum, 
respectively. 
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The problem of determining the eigenvalues and eigenfunctions for the 
angular momentum operator J v is equivalent to the problem of reducing the 
product representation given by V D in 3fC = ^ x i s into irreducible com¬ 
ponents. We will generalize this problem somewhat by seeking the reduction 
of x 3T 2 in the case in which 9~ x and are a pair of arbitrary representa¬ 
tion spaces. If = Yjk © h and = Yji 0 <j, where t k and o t are irreducible 
then it easily follows that x ^ k © (t k x a t ) where t k x o t are 
invariant subspaces which are not necessarily irreducible. In this way our 
problem is reduced to the problem of the reduction of -Tj x -T r where 
-T y belong to the irreducible representations <2>j and 3> r . 

It is easy to show that the character of the product representation in 
x ST 2 is equal to the product of the characters of the individual representa¬ 
tions in and -T 2 . Let cp = 2a; then, according to VII (3.39) (if we write 
x(cp) instead of /(D)) we obtain 

XM = t e im * (10.4) 

m= -j 

and for the character x(cp) for the representation x 3> y we obtain 
X(cp) = £ e im(p £ = £ e i(m+m> 

m=—j m' = — j' m, m' 

= I I e ik *. (10.5) 

J=\j~j'\ k=~J 

Therefore we obtain 2>j x So r = 3> j+r + S> j+ y- 1 + • • • + 

For the case in which the angular momentum of the system described by 
.Tj is equal to j, and that of the second system described by ,T r equal to /, 
then the total angular momentum J can assume the values 

j +j',j +f — 1, •••, \j —/I- 


For the case — 3^ b x t s we have to reduce the spaces ST l x that is, we 
need to add the orbital angular momentum / to the spin angular momentum 
(s = £) to obtain a total angular momentum of either j = l + \ or j = l — \ 
if / # 0; for l = 0 only; = j is possible. Here we say that the spin can either 
be added to or subtracted from the orbital angular momentum. 

We shall now consider the reduction of the vector space x 3~ y , that is, 
we shall explicitly identify the subspaces and eigenvectors of 3T j x XT y which 
correspond to the irreducible parts. According to VII (3.35) the basis vectors 
of -T) and &~y may be chosen as follows: 


U m = 


J(j + m)! (J - m )! 


and V m . = 




VC/' + m')l (f — m')l’ 


( 10 . 6 ) 


where v+, and u+, «_ correspond to the same representation .® 1/2 . We 
then expand the expression 


Kj =(u + n_ -u^v + y +j '- J (u + x + u^y) j - r+J (v + x+v^yY-- i+J (10.7) 
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in powers of x and y. The coefficients of 

j + M y j — M 

Xm ~ V(7+MjK7^M)! (1 °‘ 8) 

will, as we shall shortly prove, describe the desired linear combinations W J M 
of the U m and V m - (up to a normalization factor) which transform according 
to Si j. Kj may be evaluated in the same way for all J = j + /', j +/ — 1, 

\j - 71 - 

In order to prove this transformation property of the coefficients of X J M in 
Kj, we note that Kj is invariant under the action of the group SU 2 if the x, y 
transform contragradiently (that is, according to the inverse and transposed 
transformation; A'~ l is the contragradient transformation corresponding 
to A ) relative to the u+, Therefore the coefficients of X J M transform 
contragradiently with respect to the X J M . For the invariant (u + x + y) 1J 
we obtain 


2 J 


(u + x + u_y) 2J = £ - fp -- ul x v u 2J v y 2J v 

v =o v! (2 J - v)! 


= ( 2 j)i y _ - _ 


u J ++ M x J + M u J_-M y J-M 


= (2 jy. I 


ui +M u J S M 


m=—j + M)!(J - M)! 


(10.9) 


Therefore we find that the (u’j M u J X M )j\J{j + M)\(J — M)\ transform 
contragradiently with respect to the X J M . Therefore the coefficients of X J M in 
Kj transform like 


U J + M U J-M 

y/(J + M) \ (J - M)!’ 


( 10 . 10 ) 


that is, according to S } . 

In order to determine the coefficients of X J M in we need to compute 


(u+ v + y +j '~ J 

(u + x + u_yY~ r+J - 
(u + x + v_yY~ J+J = 


= )”( J +f v ^i + v-y +j ’- J -'(mJv + y, 

E f J + J \u + xy~ r+J - ii (u^yy, 

m=o \ A 1 / 

J £ - J p +j yv + xy(v-yy-> +j -'. 
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Instead of p and p we shall use the summation variables m = j 
m' = p + v — /. We then obtain 


= 1 Z (- 1 )' 


Jj+f -A(j-f + A(f ~j + J 


m 


— v + m 


U-i ' m \p m V'i ~ m X J + + < m + m ) 


v — p and 


( 10 . 11 ) 


= O’ + f - jy u - f + jy. a' -j + jy Y J c J m , m 'U m v m .x J m+m .. 

mm' 

With m + m' = M we obtain 


yj(j + m)!(/ — m)!(/' + m')!(/' — m')!(J + A/)!(J — M)! 
v!(/+ / — J — v)! (/ — v — m)\(J — j' + v + m)\(j' — v + m')\(J— j + v — m')! 


( 10 . 12 ) 

Thus, using the normalization factor pj we obtain 

W J M = pj £ (10.13) 

mm' 

(m + m' — M) 

Pj is computed from the normalization condition 

Ml 2 = i = l/>,l 2 ZlcL'l 2 - 

mm' 

The sum on the right-hand side must be independent of M because \pj\ 2 
cannot depend on M. We will now consider the case in which M = J. Then 
for m + m' = J (noting that in (j — m — v)! v < j — m must be satisfied; in 

(J — j — m’ + v)! = (— j + m + v)! we must have j < m + v) the only 

remaining term in the sum is given by 

_ r 1v - m y/m l(j + my.(f + m’y 

m ’ m K ' (J ~ f + jy (J + f - 7)! V O' — — m’)\ 


= (-i y~ m 

Therefore we obtain 

Z l<m '| 2 

m, m’ 

(m + m' =J) 

(2j)!(-iy ,+ '-j y // — j - j - 1 \// —/ - j — 1 \ 

(J-j 1 +j)'(j+f-jy mi \ f-m’ A j-m y 

(m + m’ =J) 

where we have used the relation 



(2 jy 


(J -/ + 7 )! (J +/-;)! V 


j + m\ j’ + m! 


m 


J-m 
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(l + z)' = XQz v and (l+zY = Y[iy 


it follows that 


(1 + zX +s = X ( r + s y = (1 + z)'(i + zy = E 

T \ z / V,H 


Therefore we obtain 


.T)- .5 (:)(;) 

(V + /J = t) 


Using this fact we obtain 


E K« 


( 2 J)!(- 1 ) 


m,m 

(m + m' = J) 


(j-f +j )UJ + j 


* r+j - J / — 2 J — 2 \ 
'+/-;)! y + f — J/ 


: 2 t il-' : J ; ; 


j+r-j. 


j +f - j 


it follows that 


E l<m'| 2 = 


(/+/ +J + 1 )! 


m, m 

(m + m' = J) 


(J - f +j)\(J+ f - j )! 0 " + / - J)! (2J + 1) 


(2J + 1)(J + / - j)! (J - / + j)! 0' + j ~ jy. 


(j+f + J + 1)! 


Thus we finally obtain 


W J M = E 


(10.14) 


m,m 

(m + m' = M) 


with (for the case in which m + m' = M ) 


di = 


yj 0 + / + T + 1 )! 

■ V(2J + 1 )(J + / - j) l(J+f +j) ! o + / - J) HT+W 

(j - m) ! (/ + m')\ (/ - m')! ( J + M)! (J - M)! 


■ E (~D V 


v! O' + / - ■/ - v )!\j - v - m)\ (J - f + v + m)\ 

(/ — v + m) \ (J — f + v — m)l 

(10.15) 
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The coefficients d J m m . are called the ClebschGordon coefficients. The d J m m . 
determine a unitary transformation for the transition between the basis 
consisting of the products u m v m ■ to the W J M as follows: 

W J M = £ U m V m'<Xm.m-,MJ, (10.16) 

m, m' 

where the unitary matrix is given by 

, m + m'dm, m -■ (10.17) 

Since the are real, it easily follows that the inverse is given by 

(10.18) 

and we therefore obtain 

= X (10.19) 

J = m + m' 

For j = 1 and / = \ we obtain the special case 

TT 

l ; 

( 10 . 20 ) 


If 3~y are subspaces of the Hilbert spaces corresponding to physical 
systems 1 and 2 with bases u m , v m ., respectively, then the W J M are not only 
eigenfunctions of the square of the total angular momentum with eigenvalue 
J(J + 1) and are eigenfunctions of the third component of the total angular 
momentum with eigenvalue M but are also eigenfunctions of the square of 
the angular momentum of both system 1 and 2 with eigenvalues j(j + 1 ) and 
/0' +0, respectively. They are not, however, eigenfunctions of the third 
component of angular momentum for systems 1 and 2. This results from the 
fact that the square of the angular momentum of systems 1 and 2 transform 
like a scalar quantity under rotations in x and therefore commute with 
the operators of the total angular momentum; this is not the case for the 
individual components of angular momentum for systems 1 and 2. 

We shall denote the total angular momentum quantum number for one 
electron by j. Thus we find that 
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is a vector in x which is an eigenvector of both J 2 and J 3 with eigen¬ 
values j(J + 1) where j = 1 + j and M = m + j, respectively, and of L 2 with 
eigenvalue 1(1+ 1) and of S 2 with eigenvalue |(^ + 1 ). The vector 



l — m + 1 
21 + 3 


ri+ 1 


U + + 


l + m + 2 


21 + 3 




( 10 . 22 ) 


also corresponds to the same eigenvalues j(J +1) and M of the operators 
J 2 and J 3 . All vectors of eigenvalue j(J + 1) of J 2 and M of J 3 are obtained 
by taking the sum of ( 10 . 21 ) and ( 10 . 22 ) with arbitrary /(r) and g(r). 


11 Fine Structure of Hydrogen and Alkali Metals 


The fine structure of the spectrum of alkali metal atoms is easier to visualize 
as that of hydrogen. The Hamiltonian operator has the form 


H = H 0b x 1 + H 1 . 


We shall write H(A) = H ob x 1 + XH x and if we let A increase from 0 to 1, 
we will obtain the following behavior for the eigenvalues: 

For A = 0, H commutes not only with U D but also with V D x 1 (and 
1 x R d ). For the eigenvalue s nl the corresponding eigenspace of -+C b x is 
given by t„, x t s where i nl is spanned by the eigenvectors (p„i(r)Y l m (9, cp), 
m = —l, —1+1,..., 1. The representation corresponding to rotations 
described by V D x 1 in t nl x + is given by the representation described by 
U D in 4 „, x i s is ^, + i /2 + for / # 0 and 0 1/2 for / = 0. Since, for 

0 < A < 1 H commutes with U D the perturbation H l splits each terme„, into 
two terms (for l # 0) corresponding to the total angular momentum quantum 
numbers j = l + \ and j = l — j. The terms e n0 cannot be split. According to 
§7 in the first approximation oi perturbation calculation the correct linear 
combination for; = l + \ is obtained from ( 10 . 21 ) by making the substitution 
f(r) = (p nl (r). We obtain 


(Pm(r) 


\l + m + 1 
' 21+1 


YLu + + 


l 


m 


' 21+1 




For j = / — j we obtain 


( 11 . 1 ) 



/ 


m 


' 21+1 


YLu., 


+ 


l + m + 1 
21 + 1 



( 11 . 2 ) 


In order to estimate the magnitude of the splitting we rpust know what H t is. 
Each vector (j) from l^xi, may be written in the form 4> = ( P+ U + + <P -U~ 
where<p + andcp^ are vectors in We therefore obtain H!<pn + =if/ + + u + + 

i/r_and H l cpu~ = i j/ + -u + + The maps + + + , 

cp -> cp -* represent linear operators in which we shall denote 
by K + + cp = i l/ + + ,K^ + cp = i^_ + ,etc. Thus we find that H x = x 

E Vfl (v, g are to be summed over + and — as indices) where E vll u f> = S pil u v . 
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The E V)t may therefore be represented by the four linearly independent 
operators (matrices) 1 , S u S 2 , S 3 . Therefore we obtain 

3 

H, = A x 1 + £ B k x S k . (11.3) 

k— 1 

Since, under rotations the S k transform like an (axial) vector, and //, must 
be a scalar, it follows that A must be a scalar and that the R k must be com¬ 
ponents of a vector. Since H l and the S k are self-adjoint, A and the B k , as 
operators in ^ must also be self-adjoint. 

According to VIII (5.8) H 1 has the following form: 

H, = — —P 4 + S -(E x P) + div E. (11.4) 
1 8m 3 2 m 2 v ' 8m 2 v ’ 


Since eE = grad V{r) where e is the (positive) elementary charge and V(r) 
is the potential energy of the electron it follows that (for atoms with no 
external field): 


A = + A AF(r), 

8m 3 8m 2 v 


Bk = 2m 2 ( grad V(r) X ^ k ' 


(11.5) 


Since grad V(r) = V'(r)(fjr) and r x P — L, it follows that 


B k 


1 V'(f) 

2 m 2 r 


( 11 . 6 ) 


According to previous results it follows that x t s = <t l+1/2 © 1/2 where 

Of 4. 1/2 and 1/2 are spanned by the vectors (11.1) and (11.2), respectively. 
For the vectors % in <>j (J = / + \ and j = l — ?) from J 2 = (L + S) 2 = 
L 2 + S 2 + 2L- S, J 2 x = J(J + 1)Z. L 2 x = 1(1 + l)x and S 2 x = s(s + l)x 
(with s = j as the spin angular momentum) we obtain 

2L ■ S X = DO + 1) - Kl + 1) - s(s + l)]y. (11.7) 


From (11.3) to (11.6), for a first approximation, we therefore obtain: 

E nlj ~ e nl + C nl + jD nl \J(j + 1) — 1(1 + 1) — S(S + 1)], (11.8) 

where C„, is obtained from A and the D nl are obtained from B k . D nl is given by 
the expression: 

i r 00 

D m = 2 ^ j o V'(r)r\(p nl (r)\ 2 dr. (11.9) 

The separation of the two terms j = l + j, j = l — \ is therefore given by 

£ nf!+l/2 — e nll- 1/2 = B nl (l + ^). (11.10) 

Since V(r) > 0, the energy eigenvalues increase with increasing j. This is 
intuitively evident since the term YjkB k x S k has the form (1/2 m 2 )(V'(r)/r)L-S. 
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According to the perturbation computation procedure for e„, we replace 
V'(r)/r by its mean value (expectation value) 

vTr) f 00 

—— = V'(r)r\(p nl (r)\ 2 dr 

’ j o 

and we obtain 

1 V'(r) -- 

— 7 — -L-S = D nl LS. (11.11) 

2 m 2 r v 

The angular momenta L and S are therefore coupled with the energy (11.11) 
where the latter is a maximum when L and S have the same direction. 

The splitting of an alkali metal spectrum by the spin is exhibited by Figure 
11. The terms for which l = 0 are not split. According to convention, the 
terms are denoted by S j; Pj,... , etc., where S, P correspond to the orbital 
angular momentum /. 
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For the hydrogen atoms (and corresponding ions) the perturbation energy 
is given by H — A x 1 + CL ■ S where 


A = 


C = 


^ p4 + ^ 4nSm 


Ze 2 


( 11 . 12 ) 


2 m 2 r 3 ' 

The eigenfunctions corresponding to the eigenvalue e„ = -me 4 Z 2 /2n 2 are 
given by 

A- 1/2 


XnijM = (Pni(rW l M 112 and i„ m = <p nt (r)W l M , 


(11.13) 


where j = / + j or / — j and the W J M are given by the expressions in the 
brackets in (11.1) and (11.2). We obtain 


(nljM | H i | n'l'j'M'} = - — 3 <nljM | P 4 1 nl'j'M'} 


1 _ 
n : 
Ze 2 


+ ^~2 <Pno(0)<5(0 


+ ■ 


8 m 2 

Ze 2 


\ nljM 


nl'j'M' 


2m 2 c 2 V 
iL/0 + 1) — 1(1 + 1) — s(s + 1)], 

where the last term contributes only for the case l # 0. Since 

±P*- Ze l 
2m r 


nljM ) _ e n d m 'd w djj'd MM ' 


n'l'j’M' ). 


nljM 

\ 

we therefore obtain 

<nljM\P 2 \nl'j'M'y = 2me„d m ,d ll ,5 JJ ,d MM , + 2mZe 2 (nljM 
Using the following notation 

/*00 

7"' = r v \(p nl (r)\ 2 r 2 dr 
Jo 

from 

<n/;M|P 4 |n/'/M'> = £ (nljM \ P 2 \ n"l"j"M"j(n"l"j"M" \ P 2 \ nl’j'M'} 

we finally obtain (since s = ■£) 

(nljM | H x | nl'j'M'} 

= S w 8jj,S MM , 

Ze 2 


i / r' i" 

2m ( e « + 2Ze 2 e„ - +ZV ? 


Ze 2 1 nl 

+ g^2 % 2 o(0)^m^i/2 + “3 O’O' + 1) - Kl + 1) - |) 
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n=3 


3d 
3d':. 
3p > 
3p~. 
3s " 


[ 2P" 


n=z 




l 

z 

Z-2 
~~ 1 
1 

"0 


J 

5/za 
3/2 b 

Z2C 


7%/ 


2s Earn EC Ialblc 0 


VzE 


Al=±l A]= 0,tl 

Figure 12 Fine structure of the line n — 3 -» n = 2 of the hydrogen atom. 


From 


we obtain 


I" ! mZe 2 1"' m 2 Z 2 e 4 

r ~~7~’ ? ~ n 3 (l + |) ; 

I nl m 3 Z 3 e 6 

7 ~ 7(1 +!)(/ + i)l 


(riljM | Hj | ril'j'M’ > = 8 lv d jr S MM , 

r l mZV / 1 3 Q-Q-+!) + /(/+1) + |) \ 

'[ 2 n 3 \/ + i An /(/ +!)(/ + i) ) 


1 Zc^ "1 

+ g ^^ 2 o(°)^o^i /2 • 

For j = l + j and for <p„ o (0) = (l/ x / / 2)(2 mZe 2 /n) 312 we obtain, in first 
approximation, the fine structure formula 


e nlj ~ 



Z 2 e A 



(11.14) 


In Figure 12 we exhibit the theoretical values for the line n = 3-»« = 2of 
the hydrogen atom. 








CHAPTER XII 


Spectrum of Two-Electron Systems 


1 The Hilbert Space and the Hamiltonian Operator for the 
Internal Motion of Atoms with n Electrons 


According to VIII, §4 the Hilbert space for an atom having n electrons has 
the form $ b x {M' n \_ where is the Hilbert space of the atomic nucleus 
(considered as an elementary system) and J’f = x t s is the Hilbert space 
for a single electron. 

In direct analogy withy III, §2 we will now introduce center of mass and 
relative coordinates. Let P and the p (k) denote the momentum operator of the 
nucleus and the kth electron, respectively. Similarly, let Q 'and q (k) denote the 
position vector of the nucleus and the kth electron. Let M and m denote the 
mass of the nucleus and the electron, respectively. We define 


Qs = 


MQ + m q (k) 
M + nm 


Q, 




_ 


Q- 


Similarly, for the momentum P s and p (k) we introduce 


P = 


M 


M + nm 


1p\ 




p(k) = pile) + 


1 


M + m 


P*. 


Then the P s , Q s and p (k \ q ik) satisfy the same commutation relations as the 
P, Q and the p <k \ q (k> . Furthermore, we find that 


J_ p2 + y S <*> 2 = - 

2 M 2m r 2 (M 
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Since M is large compared to m, we can set 

1 1 1 
- _j_ — _ 

M m m 


(1.2) 


Thus, from (1.1) it follows that the Hilbert space of an atom with n electrons 
is of the form Jf s x jP t where describes the center of mass motion and 
J^i = where J^ r = M' br x t s . Therefore has the same structure 

as the Hilbert space for n electrons, except that is replaced by J? hr , 
that is, by the orbit space in relative coordinates. From (1.2) it follows that 
we obtain a good approximation if we do not modify the mass of the electron 
in the Hamiltonian H t for the inner motion of the electrons. 

We may therefore treat an atom with n electrons in the same way as an n 
electron system, at least with respect to the inner motion. 

We obtain the Hamiltonian operator for the inner motion from VIII 
(5.8); here Z denotes the charge number of the nucleus, r ; denotes the distance 
of the ith electron from the nucleus and r ik denotes the distance from the 
ith to kth electron. The Hamiltonian operator is given by: 


H = 


n « 2 


2m 


" Ze 2 1 v e 2 „ 

X ^r + 2 £ 7~ + 

k=l r k z i,k r ik 

(i*k) 


(1.3) 


where H 1 is a correction term obtained from spin and relativistic effects 
which we shall examine in more detail in XIV, §2. Instead of (1.3) we shall 
consider a more general operator 


= — Z P {k): 

2m 


+ t y (r k ) + \ I - + H x 

k= 1 z i,k r ik 

(i*k) 


(1.4) 


in order to leave the possibility open to describe the alkali metal spectra 
by using a “mean” field (see XIV, §§3 and 9). 

If we identify from VIII, §1 with the Hilbert space of n electrons and 
we identify Hi from VIII, §1 with H in (1.3) or (1.4) then we may make use 
of the general results of VIII, §1. Then we have to identify the representation 
operators jR(^4) from VIII, §1 with the representation operators V D in 
for rotations about the nucleus. 

According to VIII (1.7) may be decomposed as follows: 


= Z © (1-5) 

j 

where is the eigenspace of J 2 (where J is the total angular momentum 
about the nucleus) corresponding to the eigenvalue j(J + 1). Each M' j 
may again, according to VIII (1.8) be written as follows: 


Jf J = P x P, 


( 1 . 6 ) 


where the representation operators V D in jP j may be written in the form 
V D x 1 and the V D in P corresponds to the irreducible representation S>j. Ac¬ 
cording to VIII, §1 the .jP j are invariant subspaces of the Hamiltonian 
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operator H ; according to (1.6) H has the form 1 x H J . Therefore the spectrum 
of H will be known if the spectrum of the H J are known. Chapters XII-XIV 
are, for the most part, concerned with finding the H j and examining their 
structure. 


2 The Spectrum of Two-Electron Atoms 

In the helium atom two electrons move in the field of a doubly charged 
nucleus. In order to take into account the corresponding ionized atoms of 
subsequent atoms in the periodic table we shall represent the nuclear charge 
by Ze. The Hamiltonian operator is given by: 

+ (2.D 

Since, from the case of the hydrogen atom and the alkali metal atoms, we 
already know that the influence of the spin is small, we shall first neglect the 
effect of //] and investigate the eigenvalues of H 0 . 

The Hilbert space for the two electrons is therefore {Jf 2 }_. Clearly 
2 = l x t 2 . In order to decompose the representation of the symmetric 
group S 2 in Jf 2 into irreducible parts, that is, in order to obtain {Jf 2 }_ we 
shall proceed in two steps as follows: 

First we reduce the representation of S 2 in 3df b and in i 2 s separately (where 
the permutation operators P in and i 2 correspond to that defined in 
2 ). Clearly S 2 has two representations—the symmetric and the anti¬ 
symmetric. Thus we obtain 3? b = {Z( J b \ + © and t 2 = {i 2 } + 

© {-? 2 }. Thus it follows that: 

Jf 2 = {^ 2 } + x {i 2 } + © pf fc 2 } + x R 2 }_ © pf> 2 }_ 
x {i 2 } + © {Jf fc 2 }_ x K 2 }_. 

It is easy to show that 

{jf 2 } + = {^f 6 2 } + x K 2 } + © pf 6 2 }_ x K 2 }_, 

= {je 2 b } + X K 2 }- © x K 2 } + . 

According to §1 the Hilbert space {Jf 2 }_ is to be used for the two electrons. 
Since H 0 does not act upon the vectors of t 2 and commutes with permuta¬ 
tions in 3&1, H 0 transforms the vectors of and {3^ b }_ into the 

subspaces {Jf b } + an d {J ^ b }-, respectively; therefore the same is true for 
vectors of {J^ b } + x {-& 2 }_ and {Jf b }- x {i 2 } + . We may therefore consider 
the spectrum of H 0 separately in {Jtf’ b }+ and {:Zf b }- . 

A precise computation of the eigenvalues of H 0 is not possible. For this 
reason we shall investigate H 0 using the method of perturbations as follows: 

H 0 = H 0 ( 1) + H 0 ( 2) + AW, (2.4) 


( 2 . 2 ) 

(2.3) 
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where 


H 0 (i) = 


1 A2 

— P- - 

2m 1 r. 


and 



(where we may think of A as increasing from 0 to 1; see XI, §7). 

The eigenvalues of H 0 ( 1) + H 0 ( 2) in are easy to determine. If <£ v ( 1) 
is the eigenvector of H 0 ( 1) in J^ b with eigenvalue e v where v replaces the three 
indices n, l, m and <f> v is the hydrogen eigenfunction XI (4.7) corresponding 
to the eigenvalues s v = e„, m = e„ = -R/n 2 then the </> v (l)(^(2) are eigen¬ 
vectors of H 0 ( 1) + H 0 (2) in with eigenvalue e V/I = s v + e„; (fi/l)<f> v (2) 
correspond to the same eigenvalue. In {Jf£}+ the vectors 


<f> v (l)(t>v(2) and 


1 

V2 


[<p v (l)M 2 ) + 0„(W*(2)] 


(2.5) 


form a complete system of eigenfunctions; similarly 


[<!>*(VM 2 ) ~ ^O)0v( 2 )] 


(2.6) 


is also such for The eigenvalues £ v + £ fl = £„, + £„ r are multiply 

degenerate. The lowest eigenvalue is 2e 1 , twice the ground state term for the 
helium ion. The values £ x + &„■ increase until the continuous spectrum begins 
at (that is, e„, = 0). The discrete eigenvalue jq + e 2 — —2R/4 — —R/2 is 
larger than the starting value for the continuous spectrum. Under the per¬ 
turbation W these discrete eigenvalues + £„(«> 2) become lost in the 
continuous spectrum; nevertheless the physical significance of these discrete 
eigenvalues will become evident in other cases which we shall encounter later 
(anomalous terms, see XIV, §4). If we are interested only in the structure of 
the discrete spectrum of H we may then expect that the eigenvalues of H for A 
continuously increasing from 0 to 1 arise from the eigenvalues e 2 + e n of 
H 0 (l) + II 0 (2). [For the question whether the eigenvalues change con¬ 
tinuously with A see XI, §§6 and 7 and [10].] The lowest eigenvalue 2 e, occurs 
only in {Jf£}+ for the eigenvector <^> 1 (1)<^> 1 (2). The remaining £ x + e n (n # 1) 
are found in both {j |?£} + and and correspond to the eigenvectors 


dnlm — —f= [<^i(1)</)„j m (2) + 0„im(l)0i(2)] 

V2 


and 


(2.7) 


2nlm = -7=W>l(l - ( Pnlm0)<t>i(2)l 

x/2 


( 2 . 8 ) 


respectively. The i p nlm , as well as the y nlm , belong to the representation @> l of 
the rotation group if we define U D by means of U D (\) x U D ( 2), where U D ( 1) 
and U D ( 2) act only on vectors in i^( 1) and J%( 2), respectively. The total 
angular momentum is therefore given by the quantum number / for one of 
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the electrons. For increasing A we therefore expect a splitting of the eigen¬ 
value Ej + e„, the effect of which will be different in {3^1} + and {Jff 
The i l/ nlm and the y ntm must be the correct linear combinations, first because 
W transforms only the vectors of the subspaces {J4?j;} + and {M'D- into 
themselves and second, W commutes with U D . Thus F n WF n (where F n is a pro¬ 
jection operator in the subspace of ^„ im or y nlm for fixed n (/ = 0,1,..., n - 1)) 
transforms the vectors i jj nlm and y nlm into themselves up to a factor (for a proof 
of this result see XI, §7). Because H 0 commutes with U D the energy eigenvalues 
do not depend upon m and we obtain 

+ £„ + 4 and % + £„ + £„;, (2.9) 

where 4 = W^ nlm ) and 4 = <y„ (m , Wy nlm ). For <f> nlm = (p nl (r)Y l m (6, cp) 
and for the special case in which 4 = (pj(r)Y° we therefore obtain 

4 = c m + A m and 4 = C nl - A nl , (2.10) 

where 

C„i = e 2 f — I0i(ri)I 2 I<4m(r 2 ) I 2 d 3 r 1 d 3 r 2 
J r 12 

and 

Anl = e 2 f — 4>i(ri)(l>nim(r2)<l>i(f2)^nim(fi) ^ 3 ?i d 3 r 2 . 

J r 12 

Intuitively C nl represents the Coulomb interaction energy of two charge 
clouds of charge density p x = e|</>j j 2 and p 2 = e\4> 2 \ 2 . A„, is called the 
exchange integral. The energy eigenvalues so obtained are quantitatively very 
poor. Essentially better results can be obtained with the aid of the Ritz 
variation principle, where the latter is described in the next section. 

The qualitative results of the previous discussion are as follows: The term 
schemes are decomposed into two classes; the class to which a given eigen¬ 
vector is assigned depends on whether it belongs to either or {Jft}- 

(see Figure 13). The lowest energy value lies in + ; there is no corres¬ 
ponding equivalent eigenvalue in because we cannot construct an 

antisymmetric function from two identical one-electron functions. For all 
other terms there exists two equivalent eigenvalues in {3^1}+ and 
which differ by the exchange integral. Since A„, > 0 (for more about the 
exchange integral see §3) the terms of lie somewhat lower than those 

in pf£}+. 

In summary, we may easily obtain the following overview of the terms by 
means of the following construction principle: Without the mutual inter¬ 
action of the electrons we could consider a term to be the sum of two-electron 
terms. Here it is customary to introduce the following notation: We shall 
denote a state consisting of an electron with principle quantum number n, 
angular momentum quantum number l = 0 (that is, s), and a second electron 
with principle quantum number n' and angular momentum quantum number 
l — 1 (that is, p) by ns n'p. By “switching” the interaction we may therefore 
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think of the discrete terms of helium as developing from the “electron con¬ 
figurations” Is ns, Is np, 1 snd,... . Each of these electron configuration terms 
(up to the ground state terms Is Is) is split by the interaction into two terms, 
a “symmetric” and an “antisymmetric.” For the electron configuration Is Is 
of the ground state we also write (Is) 2 . This term is symmetric and does not 
split any further. We will denote the total angular momentum of a term by L. 
For the helium term L is equal to the angular momentum number / of the 
“optical electron” which is not in the state Is. For the terms L = 0,1, 2,... we 
write S, P, D ,.... For (Is) 2 we only obtain a symmetric S term, from Is np we 
obtain two—a symmetric and an antisymmetric P term. 


3 Ritz Variational Principle 

We may use the Ritz variational principle in order to obtain better numerical 
estimates for the eigenvalues. Solving the equation H(p = Xq> is equivalent to 
finding extreme values of (jp, Hep) subject to the constraint <<p, <p) = 1. If, 
for example, Ax < A 2 < • • • < are the n lowest eigenvalues of 
H(Hcp v = \cp v ), each of which is counted according to its degree of de¬ 
generacy, then H' = H — £” =1 A v P Vv satisfies the property (cp, HUp) > 
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A„<«p, cp) for (cp, <;o v ) = 0 and H'(p„ — 0, v = 1,..., n, a result which im¬ 
mediately follows from the complete spectral representation of H. If 
< (p , cp} = 1 we therefore obtain (cp, Hep} = Z"=, A v | <<p v , <p> | 2 + <<p, #<p> > 
A,. The minimum Aj of (<p, Hep} will be attained for cp = ep v From the 
additional condition (cp,cp l ') = 0 we obtain (cp, Hep ) > A 2 , A 2 will be 
attained for <p = rp 2 ; etc. 

The method of the Ritz variation problem permits another basis for the 
perturbation computation described in XI, §7. Let H = H 0 + H v The 
eigenvalues and the eigenvectors of H 0 may be known. Two eigenvalues 
e x and e p of H 0 may be relatively closely spaced. We apply the variation 
principle as follows: (cp, Hep} is to be minimized subject to the constraint 
that (cp, (p) = 1 and <p is orthogonal to all eigenvectors of H with lower 
eigenvalues. Instead of seeking cp in the general case, that is, finding an exact 
solution of the problem, we shall attempt an approximate solution as follows: 
cp = Z"=i •Av^v where the iA v is a complete orthonormal basis in the space 
i a + ip (i x is the eigenspace corresponding to the eigenvalue e x of H with the 
basis \j/ 15 1 // 2 ,..., ; similarly, ip is the eigenspace corresponding to the basis 
«A„ a +1 ,..., iA„). The x v are free parameters which are chosen by making 
< cp , Hep > an extreme value. The chosen cp are orthogonal to the eigenvectors 
of zeroth approximation for the other eigenvalues of H 0 . On the basis of this 
assumption the cp are clearly only approximate solutions. Setting 

<^ V ,H 1 ^> = (vI^Iai) 


we seek the extremum of 

4 E l*vl 2 + Z l x vl 2 + Z x v<v|Hi|/t)x„ (3.1) 

v=l v=/i a +l v, n = l 


subject to the constraint Z"=il x vl 2 = L If we require that each of the 
eigenvectors (xx„) is orhtogonal to the preceding eigenvectors, then 
the solution of the above problem is a series of extreme values e ( 1 1) , 4' >, ■ ■ ■, s” * 
which are the eigenvalues for the eigenvalue problem: 

(Sa,/i-4 1) X+ Z < V |ff 1 = 0> (3-2) 

where e x or e p are chosen depending on whether v < n x or v > n x . The 
e 1 ) 1 ’, e 2 , 4° are approximate eigenvalues into which the eigenvalues 
e x ,Bp pass by the influence of the perturbation. Equation (3.2) is identical with 
the secular equation described in XI, 6 which is used to find the eigenvalues 
of an operator E v HE v (from the sum in XI (6.1)). 

In order to obtain a more precise computation for the ground state of 
helium we may seek to obtain the minimum of 


M = 



H 0 ( 1) + H 0 ( 2) + — 
r 12 



TPuJic. 
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subject to the constraint <<p, <p> = 1 by assuming that cp — <jf>,(l)</> 2 (2). 
Using the method of Lagrange multipliers X we may take into account the 
constraint as follows: 


<^ 1 (W 2 (2),ffo(lW 1 (l)02(2)> 

+ <^ 1 (l)</) 2 (2) ) H 0 (2)</) 1 (l)</» 2 (2)> 

+ ^(1)^(2), 

- A<^) 1 (1)^ 2 (2), </>i( 1)</> 2 (2)> = extremum. (3.3) 

If we replace (f>k by <t> k + d<p k and then replace d<f> k by iS<j) k , then, from the 
condition that the variation of (3.3) must vanish for arbitrary Scj) k> we obtain 
the following pair of equations: 

tfoOWhO) + <0 2 (2), H o (2)02(2)>0i(1) 

+ U 2 (2),^-0 2 (2)V 1 (1) = 20,(1), 

12 ' (3.4) 

<^,(1), ja r o (i)0 1 (i)>0 2 (2) + 

+ (^(i)k-f(i)W)=w) 


with the extreme value A of <</>,(l)</> 2 (2), //<j>,(l)<j> 2 (2)>. 

If we set (f) 1 = 0 2 = (j) for the ground state of helium (since the latter is 
symmetric) we obtain 

Ho(l)<Kl) + /<K2), </>(2)^(l) = (3.5) 

where e = 2 — <</>(2), H 0 (2)cfi(2)}. This is, however, the eigenvalue equation 
of an electron with the potential energy —Ze 2 /r l — eV(r k ) where 

VCrd=-{m t ^m^ (3.6) 

is equal to the potential of the charge density — e \ cp( 2) | 2 . For the ground state 
we can assume that (f> has spherical symmetry; then (3.6) will depend only 
on fq. Above all, in order to compute V(r) the quantity <p must already be 
known. It is possible to solve (3.5) by successive approximation where the 
V(r) from the previous approximation is used to compute the current 
approximation using (3.4). This method, known as the Hartree method of 
“selfconsistent” fields is applicable to cases involving more than two electrons. 

We will now briefly exhibit the above procedure for the case of the ground 
state of helium. For the variational problem we shall seek the extremum of 
<<£, [flo(0) + H 0 (2) + e 2 /r 12 ](/?>■ We select the special form 

oc 2 

<p = — (3.7) 
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Here only a remains to be chosen. We therefore obtain the variational 
problem 




+ fe 2 a = extremum. 


The derivative with respect to a must vanish; thus we obtain a = me 2 (Z — fg) 
and a minimum value e = - me 4 (Z — ^) 2 . 

From this result we may easily compute the ionization energy of helium. 
The latter is the difference in energy between the helium atom and singly 
ionized helium. According to XI (3.14) the energy of the helium ion is equal 
to — (me 4 /2)Z 2 . The iodization energy is equal to 


me 4 [(2 - -j^) 2 — j2 2 ] = 1.695 Rydberg. 


The corresponding experimental value is 1.810 Rydberg = 24.46 eV. In spite 
of the simple assumption the agreement is reasonably good. If we completely 
neglect the interaction of the electrons then we would obtain a value of 
E = — me*Z 2 . We shall now give the following intuitive interpretation of the 
above results: the motion of each electron is such that the charge of the 
nucleus Ze appears to be shielded by the other electron such that its effective 
value is (Z — Y^)e. 

For excited states we must require that (f> t and <f> 2 are different, in order 
that both equations (3.4) may be simultaneously solved. The fact that one 
electron must be at a greater distance from the nucleus than the other suggests 
that, for all practical purposes, we may expect that the eigenfunction </>,(!) 
will be equal to that of the ground state of the helium ion. |<£ 2 (2)| 2 will then 
be essentially nonzero where (/>,(!) « 0. For this reason we may set 
<</> 2 (2), (e 2 /r 12 )</> 2 (2)) in the first equation of (3.4) equal to a constant 
<^ 2 (2), (e 2 /r 2 )<£ 2 (2)). We therefore obtain 


H o(l)0i(l) 


A - <<£ 2 (2), H 0 (2)4> 2 (2)> - 


(M2), -^( 2 ) 

\ r 2 


<t> i(D- (3.8) 


The solution is given by 

1) = (mZe 2 ) 3/ V mZe2 ’-‘ (3.9) 

where 


A - <<£ 2 (2), H 0 (2)</> 2 (2)> - ( M2), - M2) 


Applying equations (3.4) to (j) 2 { 2) we obtain 


= e l = 




M2) - a + 


mZ 2 e 4 
2 


(3.10) 


which is an eigenvalue equation corresponding to an electron in a potential 
field 


7 e 1 

nr,) - - -f + 

r 2 


— <Mi) / = 
r 12 


(Z - l)e 2 


+ R(r 2 ). (3.11) 
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The expression within the bracket depends only on r 2 because |<£i(l)| 2 is 
a spherically symmetric charge distribution. R(r 2 ) is, for all practical pur¬ 
poses, equal to zero for distances greater than (mZe 2 ) -1 . For the 
(A + mZ 2 e 4 / 2) and <£ 2 (2) we therefore obtain eigenvalues e n( and eigen¬ 
vectors <p„ lm in the same manner as we have already discussed for the case of 
the alkali metals spectrum. We may omit the term n = 1 because, in the 
orthogonality condition formulated above we have found that ^ 1 (1)^ 2 (2) 
is orthogonal to the ground state (3.7); as an approximation we may replace 
this condition by requiring that c/> 2 (2) must be orthogonal to (3.9). The energy 
eigenvalue obtained for the excited state is approximately given by A = 
e„, — mZ 2 e 4 /2 = e„, + £,. Therefore we find that the states corresponding to 
different values of l no longer coincide. 

For fixed n and l a particular term can be found in both {Jtf’l} + and {J^£} _ 
because both Q„ + , = </> 1 (l)0„,(2) + </)„,(l)^,(2) as well as Q„1 = <jS( l)</>„,(2) - 
</>„,( 1 )</>i (2) belong to the same eigenvalue. 

We must therefore obtain better values for the energy if we solve the 
extremal problem for <<p, H(p)\\(p\\~ 2 assuming that <p = aQ^ + bQ~ t . 
Since H commutes with the permutation of the electrons the selections 
cp = and cp = Q.~, must yield extremal values. Therefore the energy level 
e„, + Si are therefore split into the energy levels: 

<Q+, HQ+>||Q^r 2 = El + £„, + A+ in pf 2 } + (3.12a) 

and 

<n„l, tfQ„- ( >||Qjr 2 = £„ + G n i - A-, in {Jf 2 }_. (3.12b) 

Since the (/>„, are not exactly orthogonal ||f2„,(|| 2 is not exactly equal to 2. 
If we neglect this discrepancy then A^ = A nl and is equal to the same 
expression as (2.10) except that we need only replace e 1 /r l2 by e 2 /r 12 — 
<</>i(3), (e 2 /r 32 )^i(3)>. We obtain 


A„i = e 2 



fl<r 3 l<Pi>l 2 ~d 3 r 3 
•> r 32 J 

<r 2 1 </>i ><fi I (PnimXr 1 1><r 2 1 (j) nlm > d 3 r l d 3 r 2 . (3.13) 


The exchange integral A nl is responsible for the different values of the energy 
levels in { l } + and {-Af'l} _. 

A„i and —A nl arise from the Coulomb interaction of electrons in the 
different states in {Jf£}+ and {,#?£}.„, respectively, a result which is clearly 
evident from the first equation in (2.10). How does the energy difference 
between these states arise? In (2.10), C nl is the interaction energy corres¬ 
ponding to the description of the two electrons as two interacting continuous 
charge clouds. If we consider a state ^> 1 (1)<^> 2 (2) then the probability that 
electron 1 lies in a volume d 3 r l about r, and electron 2 lies in a volume d 3 r 2 
about f 2 is, in fact, equal to 

K' , il4>i>l 2 |<T2l</>2>l 2 d 3 r t d 3 r 2 . 
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For the state (1/ V /2)(^> 1 (1)^ 2 (2) + 0 j (2)</> 2 ( 1)) the above expression is 
given by 

+ { <f 1 \4> iXr 2 1 1 ><r 1 1 <t> 2 > 

+ <r 2 |<^X^I</>2><ril0i><r2l</>2>}] d 3 r l d%. 

For f, = f 2 , for example, for the state </>,(l)</> 2 (2) the probability is given by 
l<Til<^i>l 2 K r i 10 2 )I 2 d 3 ?i in the second case, however, we obtain 

CK^i l^>i)l 2 K^i I02>l 2 ± K^i I 0 i>l 2 K^i I02>l 2 ] d 3 n d 3 r 2 . 

Therefore either we obtain twice the preceding result or zero. Thus we find 
the position probability of the electrons are correlated, that is, they are 
mutually dependent. Intuitively, in the case pf l } _ the electrons “avoid each 
other’s company”; for the case {Jf0} + the electrons “prefer each other’s 
company.” Thus it is understandable that the Coulomb repulsion for the 
energy levels in pf*} + i s higher than that in _. For this reason the 
exchange integral is often referred to as the Coulomb correlation energy. 


4 The Fine Structure of the Helium Spectrum 


According to (2.3) the Hilbert space for two electrons is given by 

pr 2 }_ = pr 6 2 } + x {*,}_ © pr 2 }_ x p 2 } + . 


In order to exhibit the complete set of eigenfunctions (with spin) for the 
operator 


rr 1 /n2 n2 . Ze 2 Ze 2 e 2 

H ”-5; (P ‘ + P2) ~7r“ + ^ 


we must multiply the eigenvectors in pf 1} + by antisymmetric vectors of 
{0}_ and we must multiply the eigenvectors in by symmetric 

vectors of p 2 } + . 

The space t 2 which is spanned by the vectors u + (l)u + (2), m + (1)u_( 2), 
«_(l)u + (2), u_(l)u_(2), can be easily decomposed into {0}-- and p 2 }_ as 
follows: {00 has the basis 


u + (1)m + ( 2),—p(« + (l)u_(2) + «_(l)«+(2)), u_(l)u_(2). 

V2 

{0}_ is one-dimensional with basis vector 


1 

T* 


(a + (l)u_(2) - n-(l)« + (2)). 


In t 2 the product representation of the rotation group is given by 
3) i /2 x .@ 1,2 = 3 X + 3 0 . The two spin angular momenta can therefore be 
added to obtain a total spin angular momentum of S = 1 or S = 0. In order 
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that the representation operators for the rotation group commute with per¬ 
mutations in 4 2 , they must leave the subspaces {4 2 } + and {4 2 }_ invariant. 
Since 3 l/2 x 3 ia may be decomposed into two irreducible components 3 y 
of dimension 3 and 3 0 of dimension 1, we obtain the representation 3 y in 
K 2 } + and 3 0 in {4 2 }_. 

For an energy level corresponding to an electron configuration consisting 
of one Is electron together with an electron with principle quantum number n 
and orbital angular momentum number l we find that the exact eigenvectors 
of H 0 in {Jfj} + must describe the representation 3 L of the rotation group 
where L = / because the representation will not be affected by “turning on” 
the perturbation. Therefore there exist (2 L +1) exact eigenvectors 

K(M= -L, -L+ 1 1, L) 

in + an d <£m in - , respectively, corresponding to the exact 
eigenvalues; these exact eigenvectors do not take on the approximate form 
(1 /!(1 )^n/ m (2) + <^„ im (l)</>,(2))! Let us denote these eigenspaces from 
by 4 *. Then the complete set of eigenvectors for H 0 will consist of 
the vectors from the space x {4 2 }_ for the symmetric case in and 

the vectors from the space il x {t 2 } + for the antisymmetric case in {-#1} _. 
Since 3 0 is the representation of the rotation group in {t 2 }~, we find that 
3 L x 3 0 = 3 L is the representation in 'i' L x {4 2 }_. The total angular 
momentum for both electrons, including orbital and spin is usually denoted 
by J. Therefore we obtain J = L. Since the representation of the rotation 
group in 4^ x {4 S }_ is irreducible, the symmetric energy levels cannot be 
split by means of the spin dependent (but rotation invariant) term H y 
of the Hamiltonian operator, the symmetric energy levels remain simple. 
The situation is somewhat different for the case 4£ x (4 2 }+. Here we find 
that the representation of the rotation group is 3 L x 3 y = 3 L+y + 3 L + 
3 L -1 for L > 1 and 3 0 x 3 y = 3 y for L = 0, respectively. The total 
angular momentum can therefore take on values J = L + l,L, L—\ and for 
L = 0 only J = 1. Therefore the antisymmetric S term (that is L = 0) 
cannot be split by the spin perturbation. In contrast, the other antisym¬ 
metric energy levels are split into three fine structure energy levels corres¬ 
ponding to the values J = L — 1 , L,L+ 1. 

The number 2S + 1 is called the multiplicity and is written above and to 
the left of the term symbol. For S = 0, 1 P, is therefore a “singlet” P term 
(that is L = 1) with total angular momentum J = 1. For example, for S = 1, 
3 P 2 is a “triplet” P term with total angular momentum J = 2. For helium 
the three terms 3 P 2 , 3 P 1 , 3 P 0 lie closely spaced. Since the symmetric terms 
are, according to the Pauli exclusion principle, associated with S = 0 and the 
antisymmetric terms are associated with S = 1, it is not necessary to provide 
a characterization of the terms according to its symmetry under permutation. 
Although an antisymmetric ^ term (that is, L = 0) is not split, that is, it 
has only one component 3 S t , we still call it a triplet term corresponding to 
2S + 1. The number 2 S + 1 above the term symbol on the left therefore 
specifies both the total spin as well as the symmetry of the vectors in J(?I. 
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By introducing the perturbation H 1 we find that the spin angular 
momentum and the symmetry of the position functions from (Jf£} + and 
{■#?$}- are exactly defined only for eigenvectors of the zeroth approximation 
(that is, for the “correct” linear combinations). The stronger the influence 
of H u the greater the deviation of the exact eigenfunctions from the linear 
combinations of the zeroth approximation. The value J, that is, the repre¬ 
sentation Qij cannot, of course, be changed (see XI, §7). 

In order to obtain a quantitative estimate of the magnitude of the splitting 
it is necessary to know the form of H 1 . Since we do not wish to make any 
explicit quantitative computation of these effects, we shall delay any such 
discussion about the displacement between the individual fine structure 
components until we have discussed the case for more than two electrons 
and the associated multiplicity 2S + 1. 
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CHAPTER XIII 


Selection Rules and the Intensity of 
Spectral Lines 


1 Intensity of Spectral Lines 


According to XI (1.17) the intensity of spectral lines is determined by the 
effect operator for the emission of a photon of frequency c o„ m as follows: 

4(o 3 

^F„^ m = At E n d(Q) ■ E m d(0)E n . (1.1) 

The probability for the emission of a photon in the time interval At for an 
ensemble described by the operator W is given by 

4cft 3 _. _. 

At-^tT(WE n d(0)-E m d(0)E n ). (1.2) 


Since El = E„ we find that 

tr(WE n d(0) ■ E m d(Q)E n ) = tr(£„ WE n d{ 0) • E m d(0)E n ). 

We therefore find that we need only be concerned with the components 
W„ = E„ WE„ of W. We shall often find that W„ is proportional to E„, that is, 
W„ — (1 /s„)E n tr(WE n ) where s„ is the dimension of this is indeed the case 
in XI (2.1) in which 


p P £ n 

W — f _ 

" "tr (e-wy 


(1.3) 


where H is the Hamiltonian operator describing the relative motion with 
respect to the nucleus. Equation (1.3) describes the case of thermal radiation. 
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Except for the factor tr (WE„) which depends upon W and represents the 
probability of E„ in the ensemble, the intensity of the spectral line co„ m is 
essentially determined by the expression (1.2) where we replace W by (1 /s„)E n . 
We obtain 

4 co 3 -* -* 

At-^tt(E„d(0)-E m d(0)E n ). (1.4) 

3 s„ 

Let the three spatial components of d be denoted by d v ; let us introduce a 
complete orthogonal basis v m for the subspace determined by E„. We may 
then rewrite (1.4) as follows: 

4oi 3 s ” Sm 3 

n (i-5) 

n a=l /? = 1 v= 1 

where we have replaced d v (0 ) by d v . 

In order to make assertions concerning the intensity of spectral lines it is 
therefore necessary to evaluate the transition matrix elements < v na , d v v mll }. 

In (1.5) we have considered the dipole radiation contribution. This is a 
good approximation provided (see XI, §1) that the wavelength A of the 
radiation is large compared with the diameter d of the atom. According to 
XI (3.6) an estimate of d is given by r 0 . It remains to show that, for atoms, A is 
in fact substantially greater than the diameter d (A is approximately a 
thousand times greater than d). 

Higher order effects, such as magnetic dipole radiation (see XI, §1) or 
quadrupole radiation will only be noticable if all the (v„ x , d v v m/j ) are identi¬ 
cally zero for the transition n -+m. Here we note that it is clear that in the case 
of radiation associated with other moments that we would have to replace 
(v m ,d v v mP } by other matrix elements (v„ a , Av mj ,') where A is the 
corresponding operator. 


2 Representation Theory and Matrix Elements 

In a Hilbert space let {(f>J be a set of vectors (not necessarily linearly 
independent) which span a subspace of Jf. Let be a completely reducible 
representation space for a set of operators (see AIV, §14) 

= *] ® ^ (2-i) 

where the i, v are irreducible. Let the subspace -f be invariant under the 
operators of the representation. If <p v is a complete normed orthogonal 
system which is compatible with the decomposition (2.1) then the f/> M may 
be expressed in terms of the <p v as follows: 

</V = (2.2) 

V 

A homomorphic map of into 4, is defined by -> Jjv where the 
summation may only take place over the <p v in i l . Thus the irreducible space 





2 Representation Theory and Matrix Elements 119 


i l must be isomorphic to an irreducible component of the space 9~, or, if .T 
has no irreducible component isomorphic to , all the a Vfi (where the index 
v is one of the <p v from i x ) must be equal to zero (see AIV, §14). 

Let Jr be another representation space which can be completely reduced 
as follows: 

Sc = ?! © i 2 0 • • • • (2.3) 

Let xj/p be a set of vectors (which are not necessarily linearly independent!) in 
if which transforms under the representation with the same matrix as the 
vectors that is, if U is a transformation in VC of the representation operator 
and V is the operator for the same transformation, we therefore find that 

= £ <f> v p vll and Vip^ = £ \// v p vll . (2.4) 

V V 

Let y v be a complete normalized system of vectors which is compatible with 
the decomposition (2.3). Then we may express the as follows: 

(2.5) 

V 

Let and ip be a pair of isomorphic representation spaces. Let 
<P«„ <p ai ,..., q> Xv denote the cp v from and let %p t , Ip 2 , ■ ■ ■, Xp v denote the y v 
from ip where cp Xt and y Pt are a pair of isomorphic basis for i a and Ip, respec¬ 
tively. 

We shall now construct a vector space if from a linearly independent 
basis Vp. The number of the may be equal to the number of the </>„ and of 
the i /^. if may be transformed into a representation space by means of the 
transformation v „ -> Y,v r„p VM where the p vfl are defined according to (2.4). 
A homomorphic mapping of if onto 3T is defined by v 4 -* </>„. If if has a 
decomposition of the form if = Vy + %, + • • ■ into irreducible components, 
then must be isomorphic to the sum if Xl + if Xl + • • ■ of a part of the 
We note that CT need not contain all the "V v because we did not 
assume that the (p „ are linearly independent. 

We will now assume that every irreducible component in if occurs exactly 
once. If we then construct a vector space iV which is isomorphic to t x and 
ip using a basis w t for IV which is isomorphic to the <p^ and y fir then the maps 
Vp -> w z a* ttl and -> £ r w z b flz ^ are homomorphic maps of if onto iV. 
Since if contains only different irreducible components all V v except at 
most one must be mapped onto the null space (see AIV, §14). If V n is iso¬ 
morphic to iV then it can be mapped onto the space iV. The map of V n onto 
IV is uniquely determined up to a numerical factor (see AIV, §14). Therefore 
we obtain 


( 2 . 6 ) 

where the ratio X x :X 2 depends only on i x and Ip and not upon t and p. 

The relations derived above hold for the special case in which VC — Jfr 
and if <p v = y v and i x = ip; here the (p^ and ip lz may, of course, be different. 
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Let A i be a series of self-adjoint operators such that, for the operator U, 
the following representation 

UA t U + =Y J A k d ki . (2.7) 

k 

holds. The representation 9> A is given by the matrix d ki . If q> u q> 2 , •.., <p n are 
the basis vectors of then the <l> fl = 4> ik = A t <p k (k = 1, 2,..., n) and 
H = i,k span a representation space .T and therefore contains a part of the 
irreducible components of the product representation 9) A x 9> n . From 

<fiik = A i (p k = Y, <Pv<<Pv, A i<Pk > (2-8) 

V 

we find that all matrix elements <<p v , A t (p k y for which v is the index for the 
vectors q> v in t x where i rj is not isomorphic to an irreducible representation in 
9> a x must vanish. If the representation 9> A x contains only 
different irreducible components then it is possible to determine the matrix 
elements (<p v , A t (p k ) up to numerical factors by means of group theory 
alone, by investigating the homomorphic maps of a space "V for the product 
representation 2> A . x 9 tt onto v 

3 Selection Rules for One-Electron Spectra 

In addition to the previously discussed representation of the proper rotation 
group £& 9 (the rotations with determinant +1, which are continuously 
connected to the unit element), the representation of all rotations (including 
the reflection r:x' v = —x v which was introduced in VII, §7) is of great 
importance. By analogy with the description of 9)$ as the group of rotations 
“about the nucleus” we will consider r to be a “reflection about the nucleus.” 

According to VII (7.1) we may represent r by means of a unitary operator 
R where R is defined in the momentum representation of by the equation 

R{k u k 2 ,k 3 \(py = <-&!, —k 2 , - k 3 \(p} 

and for the position representation by the equation 

R(x 1 ,x 2 ,x 3 \q>} = <-*!, -x 2 , -x 3 \<py. 

In ,3f h the (21 + l)-dimensional subspaces t vl with basis vectors (p v (r)Y l m (0, (p) 
(m = l, l — 1 ,..., — 0 are also invariant under I?; we obtain 

R(pv(r)Y l m (6, 9) = (-DVv(r)Y 1(6, cp). 

This follows from the fact that the Y l m are homogeneous polynomials of 
degree /in the components e k = sin 6 cos cp, e 2 = sin 0 sin <p, e 3 = cos Q of the 
unit vector. 

For a fixed nucleus (that is, for electron coordinates relative to the nucleus) 
the Hamiltonian operator is not only invariant under rotations but also under 
reflections, that is, (R x 1) H = H(R x 1) also holds for n electrons, where 
R acts upon the orbit space and 1 acts in the spin space of n electrons. Later 
we will make use of this fact. 
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If we ignore the spin for a one-electron system, then the eigenspace t„, 
corresponding to the eigenvalue e nl belongs to the representation of the 
rotation group and to the “reflection character” ( — 1)'. The dipole moment 
of the electron is d = ef because r is the position vector from the nucleus to 
the electron. The three components of r are the position operators Q 1 , Q 2 , 63 . 
Instead of these components we shall use linear combinations of the Q t which, 
under rotations, are transformed in the same way as the spherical harmonics 
yj, Y l o, Y{. 

It is easy to verify that 


Ri — — 7 = (2i + 1Q2), 
^0 = ~Q3> 

R-i = —7= (-61 + 1Q2) 

V2 


(3.1) 


satisfy these conditions, because, according to VII, §3 

Y\ = 


3 1 . 

— -e up sin 9, 
2 n 2 


Y 


1 _ 
0 — 



e, 


TL 


3 1 


2n 2 


e~ i<p sin 6. 


The therefore transform according to the representation 3 >,. The reflection 
character of the components R l; R 0 , , is (-1). The only transitions 

£„, -*• e„. r which can occur are those for which the reflection character of e nl 
multiplied by the reflection character of d u d 2 , d 3 is equal to reflection 
character of the energy level e„ T and where %) v occurs in the product repre¬ 
sentation of x @> l = 3> l+1 + 3>i + Therefore we find that 

(- l) r = (-1) !+1 and /' = / + 1 or / or / - 1. The value /' = l clearly does 
not satisfy the first condition; we therefore obtain the selection rule /-*■/+ 1. 
S terms can be combined only with P terms, P terms only with either S or D 
terms, etc. In the energy level diagram (Figure 11) the slanted lines indicate 
the possible transitions and their corresponding wavelengths. 

It is easy to show that, for quadrupole radiation, the selection rules are 
Z -* / + 2, l, and l — 2. 

If we take spin into consideration we find that the eigenfunctions of a term 
£„ Tj -. which arises continuously from e„, upon “switching on” the spin belong 
to the representation of the rotation group and to the reflection character 
(-1)( Dipole radiation transitions are possible only for energy levels e„ T/ 
for which the reflection character (-1) 1 ' is equal to (—1) !+1 and the angular 
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momentum quantum number / is equal to j + 1, j or j — 1. The above 
selection rules for l need not apply rigorously because the eigenvectors 
corresponding to E nij under the operator V D in need not transform precisely 
according to 2>,. However, since j = Z ± j, it follows from — 1 + 1 

that /' = / + 2, l + 1, l — 1, l — 2 are possible. From ( —1) ! = ( —1) !+1 we 
exclude the values /' = / — 2, /, / + 2; we therefore find that l-* l ± 1 is 
correct. 

Using the symmetry group of rotations and reflections it is not difficult 
to show that the exact eigenfunctions have the same form as the correct 
linear combinations of the zeroth approximation (see XI, §§11.1 and 11.2), 
that is, 


'I'mjir) 


and 




jl + m + 1 
21 + 1 


Y l m u+ + 


11 — m 
'll + 1 


rl , 

m+ 1 1 


jl — m 
21 + 1 


Y> + + 


jl + m + 1 
21+1 


for j = / + i 


for; = / - i 


where, in the zeroth approximation ^„ i(+1/2 = (p„i = ^„u- 1 / 2 • The eigen¬ 
values e nlj therefore correspond exactly to the representation with respect 
to the operator F D which acts in 

The selection rules determine the qualitative structure of a multiplet 
spectral line. A transition from an S term (that is, l = 0) can only occur to a 
P term (f = 1). The sodium D doublet, a pair of yellow spectral lines, may be 
explained in this way. The P term consists of two levels with j = | and j = \\ 
for the S term j = Both transitions ; = -§-> 5 and j = \ -> j = j are per¬ 
mitted. If we consider, for another example, a transition between a D term 
and a P term in which both terms are split, we may then expect four closely 
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Figure 14 Fine structure of a line D -> P. 
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spaced spectral lines. According to the selection rule for j, the transition 
j = 2 j = j does not occur. We therefore only obtain three spectral lines. 
For the ratios of the intensity of these lines we may make assertions of a pure 
group theoretical type providing that we assume that the exact eigenfunctions 
differ only slightly from the correct linear combinations of in the zeroth 
approximation (XI, §§11.1 and 11.20). This question will be treated in detail 
in XIV, §5. In Figure 14 the line thickness provides a crude estimate of the 
intensity for the transition. 


4 Selection Rules for the Helium Spectrum 

For the case of dipole radiation a helium atom term with total angular 
momentum quantum number J can only be combined with a term J' = J + 1, 
J or J — 1. The proof is obtained from x 0, = @ J+1 + 2 j + @>j _, 
directly from the above discussion and from the fact that the eigenspace of a 
term with quantum number J the representation of the rotation group is 

For the accuracy with which the exact eigenfunctions may be described 
by the “correct linear combinations” of zeroth approximations with respect 
to the spin as a perturbation, we may deduce the following additional 
selection rules: The eigenfunctions of an S, P, D term correspond to the 
representations @ 0 , S> 2 , ■ ■ ■ with respect to the operators V D which act 

in 3%. Since x Q> 1 = @> L+l + $> L + @ L - U it follows from the above 
considerations that the only possible transitions for dipole radiation are 
L-+L-1,L,L+1 . 

The reflection symmetry of a term consisting of a Is electron and a second 
electron with orbital angular momentum / is obviously (—1)'. Therefore we 
must exclude the transition L -> L. 

Since the operator for the dipole moment d = e(r t + r 2 ) (and similarly 
for the quadrupole moment) is transformed into himself by permutation of 
the electrons, it transforms a vector of the antisymmetric or symmetric 
representation of the permutation group (with respect to the permutation 
of position coordinates alone) into one of the same type, so that transitions 
are possible only between the symmetric terms (and correspondingly between 
the antisymmetric terms). An example of such a term system is given by 
ortho- and para-helium (see Figure 20). The transitions indicated in Figure 20 
illustrate the selection rules for L and for the permutation symmetry. The 
lowest level of the 3 S terms is therefore said to be metastable. A transition 
Is 2s 3 S -> (Is) 2 ‘S (not necessarily by means of dipole radiation) is not 
absolutely impossible because the exact eigenfunction of Is 2s 3 S is not a 
product of an antisymmetric position function and a symmetric spin function 
as is the case for the correct linear combinations of zeroth order. Such a 
transition is, however, seldom seen in the laboratory because an atom in the 
state Is 2s 3 S in a gas will collide with other atoms (because the ideal gas 
considered in XI, §2 is not really ideal!) and must undergo a transition to the 
state (Is) 2 ‘S before it can emit a photon. 





CHAPTER XIV 


Spectra of Many-Electron Systems 


1 Energy Terms in the Absence of Spin 


The Hamiltonian operator for / electrons 

i— 1 i= 1 ' 


+ I 


<k 


is invariant under the symmetry group of rotations 2><$, the group 9t of 
reflections consisting of the elements e and r and the permutation group 
S f , that is, under the group S s x f x S f . The eigenspace of H 0 will 
therefore be the representation spaces of 3) 9 x 91 x S f . Except for 
“accidental” degeneracies such an eigenspace of H will be irreducible with 
respect to the group x 9t x S f . 

If t is such an eigenspace, then it can be reduced with respect to the repre¬ 
sentation of as follows: 

* = ti © *2 © ' ’' • (IT) 

The representation operators for the group Sy in Jf { commute with the 
representation operators for the group S> 9 . For this reason (see AIV, §14) 
we may choose the basis vectors in in such a way that we obtain a matrix of 
the form 


h 

h 

'I'Ll 

*L- 1 

' Pl-11 ■ 

l-L 

■ ■ 'I'-Ll 

@L 


i 2 

< I'Ll 

'I'l- 12 • 

’ ’ *A - L2 


(1.2a) 

*r 

'I'L 

'I'L-lr ' 

' ■ 'I'-Lr 




A 

A 

•• A 
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where, for fixed n, the i/^ generate the irreducible representation 3 L of 3 9 , 
and for fixed M, for each M generates the same irreducible representation A 
of Sj. The above matrix therefore also spans an irreducible subspace 0~ of t 
with respect to the group 3 9 x S f . Since r commutes with respect to the 
elements of 3 9 x S f , we may choose 0~ such that the reflection operator 
results in the multiplication of the vectors by either 1 or — 1. Since we assumed 
that t is irreducible with respect to 3 9 x S f x 01, we therefore find that 
S' = *. 

Thus the space 0~ of the basis vectors (1.2a) can, in a formal sense, be 
written as a product space (see AIV, §14) as follows: 

^ = ^xJ 4 , (1.2b) 

where the representation operators of <3% have the form U x 1 and those of 
S f have the form 1 x V and, in addition, 0~ L and ,0^ are irreducible with respect 
to the rotation group and the permutation group, respectively. We will denote 
the basis vectors of 0[ by (M = — L,..., L), and the basis vectors of 0^ by 
$ v (v = 1 ,..., r ); we then obtain 

</4v = $m<£v (1.2c) 

Each energy eigenvalue of H 0 in M'{ can therefore be characterized by an 
orbital angular momentum L corresponding to the representation 3 L and by 
a representation A of S f . 

For the case of helium with two electrons L takes the values 0,1,2,... where, 
for each value of L with the exception of L — 0, both possible representations 
A of S 2 —the symmetric and antisymmetric—occur. Since the Pauli exclusion 
principle must be satisfied, symmetric eigenvectors of must be combined 
with the antisymmetric vectors of and vice versa. The antisymmetric 
vectors in t 2 form, according to XIII, §4, an irreducible representation space 
for the rotation group with the representation 3 0 ; the symmetric vectors 
form a corresponding irreducible representation space with representation 
Thus the symmetric eigenvectors in uniquely correspond to the total 
spin angular momentum quantum number S = 0, the antisymmetric with 
S = 1. Similarly, for the case of/electrons we may make the following claim: 

In t{ we may reduce the group 3$ x S /; that is, we can choose the basis 
vectors in i{ such that they can be ordered as a matrix as follows: 


r s 

fs-i • 

•• r_ s 


1 

1 

1 


nfi 

u I-ii 

■ • M -S1 

3 , 

«S2 

U S-1,2 ' 

■ ■ W -S2 


u s st 

u S-lt 

' ' M~St 


A' 

A' 

•• A' 



where each row of the matrix generates a representation space of 0) 9 for 
the representation 3 h and each column, in the same way, transforms 
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i 

according to a representation A' of the symmetric group S f . The matrix 
therefore generates a subspace of t{ which is irreducible with respect to 
the group x S f . The reflection r need not be described because, according 
to VII, §7, r behaves like the unit operator in t{. The representation of 2% in 
i{ is given by 2 1/2 x 2 1/2 x ••• x 2 l/2 = (2 , i2 ) f - The representations 
2 S are therefore obtained by reduction of this product representation. Since 
2 S x 2 r = 2j+y + 2 j+j ,_ 1 + • • • + it follows that 

2 1/2 X 2 m = 2, + 2 0 ; 

2 1/2 x 2 1/2 x 2 l/2 = 2 2 / 2 + 2 1/2 + 2 l/2 = 2 3 / 2 + 22 l / 1 ', 

2 1/2 X 2 1/2 X 2 l/2 X 2 ll2 = {2y/ 2 f = 2 2 + 3 2 l + 2 2 0 ; 

etc. Therefore the possible S values for / electrons are S = f/2, f/2 - 1,... j 
or 0. 

In §7 we shall prove that for a particular value of S there is only one sub¬ 
space iV\' of and that different S correspond to different A'. Therefore we 
obtain: 

f/2 

4= £ ® iri- (i.3b) 

S= 1/2,0 

According to AIV, §14, the subspace iV\ with basis vectors (1.3a) can be 
written as a product space 

= % x (1.3c) 

where we find that the rotation group in iV s and the permutation group in 
have irreducible representations. Let the basis vectors of be denoted 
by u s M (M = —S, ..., S), and those of 'W A , be denoted by u p (ji = 1,..., f); 
then we obtain 

«mv = (l-3d) 

We shall now seek the antisymmetric subspace _ of where = 
x or, what amounts to the same thing, to select the vectors in all the 
x iV\. for the antisymmetric representation of S f in r . If we use the 
form (1.2b) and (1.3c) for ST and then we obtain x = {2 L x 

if s ) x (21 x iV'tJ). Therefore we need to find those vectors in x 
which transform according to the antisymmetric representation of S f . The 
representation of S f in x is A x A\ When and how often does A x A' 
contain the antisymmetric representation? To provide an answer to this 
question we shall consider the following more general question for an 
arbitrary group: How often does the representation A p x A„ (the product of 
two irreducible representations) contain an irreducible representation A v ? 
The character x of A p x A a clearly satisfies y(a) — y p (a)yja). According to 
AV, §8 we find that 


/Ja)/Ja) = £ c v p<r yja), (1.4) 

V 
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where the c] )a are integers and specify the desired multiplicity of the repre¬ 
sentation A v in A p x A ff . For unitary representations, from the orthogonality 
relations AV, §8 we therefore obtain: 

= \ £ xJa)7. p (a)/Ja). ( 1 . 5 ) 

If A v is the antisymmetric representation A_ of S f , we then have 
X~(P) = ( — l) p , and we therefore obtain 

) P X P (P)X<r(P)- (1-6) 

Yl I p 

To each representation A„ of S f there exists an “associated” representation 
A „., the matrix of which is (— l) p times the complex conjugate matrix of A,. 
For A_ as the antisymmetric representation and A a as the conjugate complex 
representation of A„ we obtain A ff - = A„ x A_. Therefore we may rewrite 
(1.6) as follows: 

i I x p (P)xAP)- (1.7) 

fl i p 

According to the orthogonality relations the right-hand side of the above 
equation is either 0 or 1 according to whether a' =£ p or a' = p. 

Therefore we find that .T A x W A . contains the antisymmetric representa¬ 
tion either once or not at all depending whether A' is or is not the associated 
representation for A. Since A' is uniquely associated with the value of S, this 
is also the case for the representation A which corresponds to an eigenvalue of 
H 0 permitted by the Pauli exclusion principle. All such ST A and corresponding 
terms having a representation A which do not have associated representations 
in i{ do not occur in nature. For fixed electron number / the permitted 
representations A will be distinguished by the index S. The only vector from 
^x^ which transforms according to the antisymmetric representation 
of S f is easy to determine. Since A' is the associated representation for A, we 
find that r = t. For the normalized vector. 

$ = -4(1.8) 

V r k 

we obtain 

P$ = -JfE (PfoXPfi*) 

V r k 

= JL rltiP lk u n P nk (~ D P , (1.9) 

y/r kin 
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where P lk is the matrix of the representation A and P„ k ( — 1 ) p is the matrix 
corresponding to the representation A'. Since the representation of S f is 
unitary, we obtain P, k P nk = d ln , and from (1.9) we therefore obtain 

P$ = (- l) p $. (1.10) 

$ is the desired vector from .% x Since x iV'\, — (2T L x iV s ) x 
{.T A x iV A ), the antisymmetric subspace {ST x 1V'\^~oigr x iV\. is given by 

05 X x {$}, (1.11) 

where {$} is the one-dimensional subspace of.^ x spanned by <fi. If we 
introduce the identification (1.2c), (1.3d), then, for the basis vectors of 
{ST x iP'%} we may use the 

<t>mun s = “7= £ ^£ijk«4 s *» (1.12) 

v r k 

where m L = — L.L and m s = From (111) we may write the 

basis vectors as follows 

$mA<l> (1-13) 

from which we may easily recognize their transformation properties under 
rotations and permutations. {ST x 1V S A ,} is the eigenspace corresponding 
to an eigenvalue of the energy operator H 0 where the influence of the spin 
upon the energy has been neglected. Such a term is usually denoted by 2S+1 X 
where X = S, P, D,... depending on whether L = 0, 1, 2,..., 2S + 1 and is 
called the multiplicity of the term; it also characterizes the total spin and 
the representation A of the permutation group. 


2 Fine Structure Splitting of Spectral Lines 

The total energy operator must have the form H = H 0 + H' where H' 
describes the effect of spin upon the energy. We again consider the operator 
H{X) — H 0 + XH', where we permit 1 to increase from 0 to 1. The eigenvalues 
of H 0 , called parent terms, are degenerate of order (2 L + 1) • (2 S + 1) and 
correspond to the eigenvectors (/> mrms - The representation of the rotation 
group in this eigenspace is SX L x 2 S , a result which directly follows from 
(1.13). Since XX L x SX s = S> L+s + £X > L+S -! + ••• + @\ L -s\> the parent terms 
will be split into several components corresponding to the total angular 
momentum quantum numbers J = L+S, L+S — 1,..., \L — S| (that is, 
for L> S, into 2S + 1 components) because H commutes with the rotation 
operators U D = V D x R D (where V D and R D act in :#{ and r{, respectively). 
We shall denote the individual components of the energy level 2S+1 X by 
the symbol 2S+1 Xj. 
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It is easy to determine the correct linear combinations for a perturbation 
computation. Since the transform like products \j/ mL U ms , from 

XI (10.14) we obtain: 

n, = I 4U s (L,S)</> mLms . (2.1) 

rtiL + ms-M 

According to VIII (5.8) the operator H has the form: H = H 0 + H' where 
H' = Y,t= i H k providing we exclude external fields. Since H u H 2 , H 6 
commute with V D , they may only result in a displacement of the energy 
levels from those of H 0 . Only H 3 , H A and H s contribute to splitting. From 
experience we find that H A can, for the most part, be neglected in comparison 
to H 3 and H 5 . Therefore we find that the only component of H' which 
significantly affects the splitting is an expression of the form H" = , H u 

where the H il may easily be written as diagonal elements of a tensor H ik . Here 
H lk is a tensor in the sense that, for fixed k, the H ik behave, under V D as a vector; 
similarly, for fixed i, they behave like a vector under R D . Thus, in the subspace 
spanned by <f> mLms (or F^) we obtain 

H ‘$ m]L m s muns > ( 2 . 2 ) 

because the matrix elements of the operators H lk are, up to a factor, uniquely 
determined by the transformation properties of the H ik . We shall now prove 
the above claim. 

Under transformations of the form V D the H ik transform according to 3> y , 
and under transformations of the form R D according to 92 x . In addition to 
the rotation group 2d<g let us consider the group 92 x , that is, the outer 
product of the rotation group with itself. Its elements are clearly all pairs 
D t x D k of two rotations £>, and D k (note that D t x D k is not the usual product 
of two rotations!). We have often investigated the representations of a group 
2/ x x 2 2 (see, for example, 92, s x S f in § 1 ). If the irreducible representations 
of and 2 2 are already known, then we know the irreducible representa¬ 
tions of 2 X x <g 2 . For example, if t x is an irreducible representation space 
of 2 x and i 2 of 2 2 then i x x t 2 is an irreducible representation space for 
2 x x 2 2 \ conversely, if i is an irreducible representation space of 2 x x 2 2 
then, according to theore ms in AIV, § 14 , we find that $. can be written in the 
form x» 2 (as we have often used earlier) where the representation 
operators of ^ l5 2 2 have the form U x 1 and 1 x V, respectively. If A is the 
representation of 2 X in i x and A' is the representation of 2 2 in *2 then we 
denote the representation of 2 x x 2 2 in x t 2 by A # A'. The irreducible 
representations of the group 2 9 x <2> 9 are therefore the 3>j * 92 y . The H ik 
are therefore subject to the representation 92 x * 3) x \ the </> mt ms are subject 
to the irreducible representation 92 L * 92 s (see, for example, (1.13)). 

Since {92 x 2 ® x ) x {92 L * 2> s ) = Xs' + =s- 1 1 * @s', the ex¬ 

pansion coefficients of the H ik (f> mLms with respect to the <t> mLms are identical 
to those of the LiS k <f) mLms up to a factor (see XIII, §2). 

If, in the eigenspace we select the basis T^ (defined in (2.1)) instead of 
the basis then the matrix of £? =1 H u is diagonal and equal to the 
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t multiple of YJ= 1 L t S t = L S. Since 2L - S = J — L 2 — S 2 the eigenvalues 
of H are split, in a first approximation, by a magnitude 

AEj = T - [J(J + 1) - L(L + 1) - S(S + 1)]. (2.3) 

The energy difference between two successive energy levels is given by 

A£/+i — AEj = r(J + 1). (2.4) 

If the factor t is positive, we speak of a normal, otherwise of a reversed 
multiplet. 

Here we note that Lande’s interval rule, given by (2.3), holds, of course, 
only for multiplet splitting which is small compared to the separation 
between the parent terms. For some atoms such is not the case (see §3). 


3 Structure Principles 


In order to obtain an overview of the different energy levels of a complicated 
atom it is useful to think of the eigenvalues of the operator 


H 0 = 



v Ze 2 „ e 2 

£—— + I - 

i r i i<k r ik 


as continuously arising from an operator 


where V(r t ) takes into consideration the mutual interaction of the electrons 
in addition to — Zc 2 /r ; . If we then think of “switching on” in a continuous 
manner from H 00 to the complete mutual interaction of the electrons, then 
we may easily determine which energy levels of H 0 result from the energy 
levels of H 00 . The energy values of H 00 are equal to e„ lh + e„ 2h + • • • + e„ flf 
where e nl is an eigenvalue of the operator jP 2 + V(r) corresponding to the 
principle quantum number n and the angular momentum quantum number /. 
This situation is qualitatively analogous to that described in XI, §8. Let t nl 
denote the eigenspace corresponding to the eigenvalue e nl in Then the 
eigenspace ZT corresponding to the eigenvalue e„ lh + ■ ■ ■ + e„ flf of H 00 in 
will be spanned by t nih x t„ 2h x • • • x t„ flf and by all the other spaces 
obtained from the latter by the application of the permutation operators. 
L et Inim (where m = —l, -l + 1,..., 0 denote the basis vectors of then 
the eigenspace for the above eigenvalue is spanned by the set of all 
Pin, ;,m,(l), Zmi 2 m 2 (2), • • •, Xn f i f m f (f) where P denotes all elements of S f and 
m all values of — / to l. 

The eigenvalues £„ lh + ■ ■ ■ + e nflf are called the “electron configuration,” 
and isoften denoted by ( n 1 l l )(n 2 l2 ) • • • ( n f l f ). For the case in which some of 
the («j / ; ) are identical, we often abbreviate the above expression as follows: 
(n l l l y'(n 2 1 2V 2 ■ ■ ■ {n t l T f x where a v is the number of the same (n v / v ), that is, 
the number of “equivalent” electrons. 
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The reflection character of an electron configuration is clearly (— l) 1 ' 1 '. 
Therefore all energy levels of H arising continuously from this electron 
configuration as the mutual electron interaction is “turned on” must have the 
same reflection character ( — l) s,ii . 

In order to determine the energy levels of H 0 into which a given electron 
configuration is split by the interaction of the electrons we must reduce the 
eigenspace corresponding to the eigenvalue s„ lh + • • • + s nfl/ of H 00 
according to the group x S f , that is, we must seek basis vectors <pm L k 
which may be arranged in a matrix of the form (1.2) where the rows and 
columns transform according to irreducible representations Q L of and A 
of S f , respectively. Each such matrix will then give an irreducible representa¬ 
tion of x Sj. By switching on the interaction the basis vectors may be 
changed, that is, -* 'P l M L k ■ Nevertheless, the representations $> L and A 
are not changed. Therefore the type of terms of H 0 with respect to their 
representation in the form (1.2a) is exactly known from the electron con¬ 
figuration. A detailed solution of this problem in the manner described above 
requires detailed knowledge of the representations of the group Sy and will 
be carried out in §7. Since each of these matrices (1.2a) (where ip is replaced 
by ip) can only be combined with a particular value of the spin quantum 
number S, we may proceed as follows: Instead of determining the \j/M^ k and 
later establishing the connection with the L r * s k by means of (1.12), we 
immediately seek the 


1 


l^kUlsk 

r k 


which lie in {Jf ■ f } _. To do this we proceed by making use of the set of eigen¬ 
vectors of H 00 in { f } _ = {( Jf b x t s ) f } _ . In the eigenspace of the 
eigenvalues e nih + • • • + e nfif is spanned by the vectors 

^ > XnihmX^')^niiX)Xn2hfn2^)^'ii2^'') * ' * Xnflfmf(f')U[if(f ')J (3-1) 

where m t takes on values —l ( to Z ; , /q takes on values —j and \ and P runs 
through all permutations of S f . The basis vectors for the subspace 
corresponding to are obtained from the Slater determinant XIII (4.8) 

as follows: 


r nilimun ,n2hm2ti2, ••• ,nflfmffif 

= - 4 r, £ (- i) p n ni „ mi (iK,(i) • • • Xn f , fmf (fK f (n (3.2) 

v/ ! p 

Here no pairs (n^mi/T;) may be identical, otherwise the right-hand side of 
(3.2) would be zero. Thus interchanging the order of the (n^wi^!), 

(n 2 l 2 m 2 n 2 ), ( n f l f m f n f ) as indices for i p nihmwt . „ } i fmm does not lead 

to a new vector; we may therefore impose an order, for example, lexiographic 
order, as follows: n,/ £ mf/z £ may be on the left of n k l k m k n k only if n, > n k or if 
ni = n k then/ ; > Z t orifn ; = n k ,Z ; = l k ,mi > m^orifn; = n k ,l t = 4,m ; = m k . 
Hi > Hk- 
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We now again consider the transformations V Dl x R D2 (where D t is 
independent of D 2 ) as a representation of 2t v x 2$. The desired 
generates an irreducible representation 2 L * 2 S with respect to 2 9 x 2 9 . 
Since V D commutes with R D , the reduction of JT can again be expressed in 
the form of a matrix, where the rows and columns transform according to 
representations 2 L and 2 S with respect to V D and U D , respectively. The 
matrices obtained in this way yields the desired <t> mLms - The fact that we may 
use the entire group 2 9 x 2 9 instead of the group 2 9 rests upon the fact 
that H 0 commutes with all V Dl x R D2 . Including the influence of spin we find 
that the entire energy operator H commutes only with U D = V D x R D . With 
respect to U D the matrix of the <f> m , ms satisfies the representation 2 L x 2 S = 
2 L , s + • • • + @\l-s\ where the latter corresponds to a multiplet splitting. 

With the help of the characters it is not difficult to determine the irreducible 
representations 2 L * 2 s in _ and their multiplicity with which they occur. 
Since all the transformations V D (as R D ) are equivalent to diagonal trans¬ 
formations we need only consider rotations D about the third axis to find the 
characters: 


^D^nihmuii . 

_ p i(mi + m 2 + ... +m f )<xJ / 

..nflfmfUf c Ymlimim ...rtflfTtifUf 

(3.3) 

and 




— p i(n 1 + H2+ ... +Hf)P t l, 

...nflfttiffif c V 7 nilirmm ...rifl/m/fif 

(3.4) 

The character of the representation of 2 9 x 2<g in is therefore 




(3.5) 


nihmaxi 


where the summation is over permitted values of The character of 

2 L » 2 S is given by 


L S 


I I 

m_L = ~ L ms= ~S 


e i(am L + Pms) 


(3.6) 


We will now show how we may decompose the character (3.5) according to 
the characters (3.6). We begin by writing down all possible symbol sequences 
in lexiographic order (where n,/ ; is fixed, m ; = £): 


(n l l l m 1 fi l )(n 2 l 2 m 2 fi 2 ) •' • ( n f l f m f n f ) (3.7) 

(here two identical symbol sets may not occur). Each such symbol set 
characterizes a basis vector for 2T _. After each symbol set we may write 
m t and Hi- This pair of values ]T ; m,, £7 n t can be arranged in a table 
of the form 



Piifie. 7lltt£fccj<t giumf Pfcy iioi 



3 Structure Principles 133 


marked by x’s. For each desired representation @> L * @> s at each of the 
locations m L , m s for m L = —L, — L + 1 L, m s = —S, —S + l,...,S 
an x has to be deleted (that is, in total (2L + 1 )(2S + 1) x’s). We begin with 
the largest possible value of m L , and for this value of m L , the largest possible 
value of m s . 


Example. 

Three equivalent p electrons (that is, l 

= 1): 


(n 

1 1 

2) (n 

1 1 

~b 

(n 1 

0 

b E w i = 2 

i 

i 

( 

1 

i) ( 

1 

-b 

( 

-1 

b 1 

1 

2 

( 

1 

i) ( 

0 

b 

( 

0 - 

-i) 1 

1 

2 

( 

1 

b ( 

0 

b 

( 

-1 

b 0 

3 

2 

( 

1 

b ( 

0 

b 

( 

-1 - 

-b 0 

1 

1 

( 

1 

b ( 

0 

-b 

( 

-1 

b 0 

1 

2 

( 

1 

-b ( 

0 

b 

( 

-1 

b 0 

1 

2 


It is not necessary to include additional symbol sets because they would 


result in negative values for the two sums m t or /r ; and do not permit 
further representations of 3i l * S> s . The table has the form 



L = 2, S = \ are the largest values of m L and for m L = 2 of m s . For each of 
the m L , m s = 2, b 1, 0, ^ one of the x’s has to be deleted. For the rest the 

largest values of m L , m s are given by m L — 1, m s = j, that is, L = 1, S = \ 
and we finally obtain L — 0, S = §. We therefore obtain three matrices 
corresponding to representations S> 2 * and S> 0 * ® 3/2 . 

From the eigenvalue 3s ni of H 00 , that is, from the electron configuration 
(nl) 3 we obtain three energy levels of H 0 — the 2 D, 2 P and 4 S terms. 

In practice three rules greatly simplify the process of finding the energy 
levels of H 0 corresponding to a given electron configuration. We note that 
2 (2 1 + 1) electrons with the same (n, l) values are said to form a full shell 
because it is not possible to have more than 2(21 + 1) electrons with the 
same (n, l) value. Thus we obtain the following rule: Full shells in an electron 
configuration do not increase the term manifold, that is, they may be neglected 
in the determination of the terms. 

The proof of this fact follows from the fact that in each symbol series all 
possible (n, l, m, fx) occur for a full (n, /) shell and therefore do not contribute 
to m, and p ; . 

The second rule states that in each symbol set we can separately treat 
groups of equivalent electrons and, afterwards we can combine such groups 
without taking into account the Pauli exclusion principle. 
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For example, if we are given two groups of equivalent electrons which 
correspond to a set of values m' L ,m' s and m2, m's (arising from m t and Y,i PiX 
then it is easy to see that for both groups taken together the pair m L , m s can 
take on all combinations of values m L = m' L + m'l and m s = m' s + m£. 

Therefore, if L', S' is a term for the first group and L", S" is one for the second 
group, then, for both groups taken together we obtain all possible pairs L, S 
for which L = L + L", L' + L" - 1,..., | L - L"| and S = S' + S", S' + 
S" — 1,..., | S' — S"|. We may therefore arbitrarily combine terms of the 
individual groups. 

The third rule states that h and 2(2/ + 1) — h equivalent ( n , 0 electrons 
yield the same term manifold. This results from the fact that to each symbol 
set (w/m^) (nlm 2 p 2 ) • • • of h electrons we can append a symbol set for the 
remaining 2(2/ + 1) — h equivalent electrons. For the series of 2(2/ + 1) — h 
for m t , Y,i Hi we obtain the negative value of m h for the h electrons, 
from which it follows that for both cases we obtain the same manifold of 
pairs X, ntt and 

On the basis of these three rules it is only necessary to specify the terms for 
equivalent electrons for the most important cases. These are given in Table 1. 


Table 1 Terms of equivalent electrons. 


Electron 

configuration 

Terms 

P 2 

i s i D 3 p 


P 3 

2 P 2 D 


d 2 

‘S l D l G 3 F 


d 3 

2 P 2 D( 2) 2 F 2 G 2 H 4 P 

4 f 

d 4 

‘5(2) l D( 2) 1 F 1 G(2) 7 

3 P(2) 3 D 2 F( 2) 3 G 3 H 5 D 

d 5 _ 

-is 2 P 2 D( 3) 2 F(2) 2 G( 2) 
______ 

2 H 2 I 4 P 4 D 4 F A G 6 S 


The procedure described above for determining the energy level terms from 
the electron configuration by first “switching on” the interactions of the 
electrons and then the effects of spin is called the structure principle, and will 
be discussed in the next section on the so-called periodic system of the 
elements. In addition, some examples for term schemes will be discussed. In 
order to make qualitative statements concerning the location of the terms 
we only need the fact that the terms of higher multiplicity (that is, large spin 
values) which arise from an electron configuration have lower energy. We 
have seen this for the case of two electrons in helium. In some cases the multi- 
plet splitting is not small compared to the separation between energy levels 
(the so-called parent terms) which arises from an electron configuration 
taking only the Coulomb interaction into account. Thus the structure 
principles establish the correct energy manifold but do not necessarily 
establish the correct location qualitatively. If the effect of spin is small we 
shall speak of Russell-Saunders coupling. In this case, as described above, 
the orbital angular momentum of the electrons with the quantum number 
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/j are added to the total orbital angular momentum with the quantum 
number L; similarly, the spins are added to the total spin S. Then the in¬ 
fluence of the spin splits the parent terms in different fine structure terms 
corresponding to different values J of the quantum number for total angular 
momentum. 

In the extreme opposite case the influence of spin is large compared to the 
Coulomb interaction of electrons. This case occurs for large values of 
Z ( H 5 is, according to VIII (5.8), proportional to Z!) and is called jj coupling. 
In order to estimate the location of terms it is advantageous, as in the case of 
a single electron, to examine the effect of “turning on” the spin perturbation. 
The terms e nl will split into components e nlj as described for alkali atoms. Then 
it is necessary to examine the effect of “turning on” the smaller perturbation 
of the Coulomb interaction between the electrons. This may be carried out 
by reducing the representation of the rotation group in the eigenspace 
corresponding to the energy eigenvalue + ■■ ■ + e„ flfjf . The term 
manifold obtained from the electron configuration (n^) ( n 2 l 2 ) ■ ■ - (n f l f ) in 
the manner described above must be the same as that obtained earlier in this 
section. 


4 The Periodic System of the Elements 

In Table 2 the electron configuration of the ground state of atoms and the 
symbol for the ground state are given. For helium the Is shell is filled; thus 
the ground state must be an ‘Sq term. For lithium one electron occupies the 
2s shell. The filled Is shell does not contribute to the term manifold; thus, we 
may treat lithium as a one-electron problem, as we have done in XII, §8. 
Since the Is shell is occupied, the third electron cannot have the configura¬ 
tion Is (see Figure 10). For beryllium the 2s shell is already filled; thus 
the ground state is again an 1 S 0 term. We may expect that the term scheme 
for beryllium will be similar to that of helium, corresponding to the electron 
configurations (ls) 2 (2s)(ns), (ls) 2 (2s)(np), (ls) 2 (2s)(nd),.... With the excep¬ 
tion of the (ls) 2 (2s) 2 term, the terms are singlet and triplet; according to the 
above rule the triplet terms have lower energy than the corresponding singlet 
terms. This is shown in the left side of Figure 15. 

In addition there are terms (see Figure 15) which arise from the con¬ 
figuration (ls) 2 (2p)(np) and similar configurations. In part they lie in the 
continuum, that is, they are not regular discrete energy eigenvalues (and are 
therefore called anomolous terms) because a probability for ionization of 
atoms exists but is very small, and as a result there is at most a radiative 
transition to a lower energy level. 

From boron the 2 p shell will be filled until we reach neon, a noble gas. 
Later we shall discuss the term schemes for the elements carbon and oxygen. 
The further structure of the elements is clearly evident in Table 2. For argon 
the 3 p shell will be fully occupied. For potassium instead of a 3 d electron we 
find a 4s electron which is energetically more favorable. Theoretically this 
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result can be obtained only by numerical calculation from the eigenfunctions 
of the potassium atom. For calcium the 4s shell is fully occupied. For the 
elements numbered from 21 to 28 the 3d shell will become filled. A similar 
situation exists in the case of the rare earth elements (numbered 58-70) where 
the 4/shell is filled. The chemical behavior is determined by the outer electron 
shell; this fact explains the chemical similarity of the rare earth elements. 

For an example of an energy level diagram we will consider the case of 
carbon (Figure 16). The electron configuration for the ground state of carbon 
is (ls) 2 (2s) 2 (2p) 2 , which, according to Table 1 for two equivalent p electrons 
results in three terms 1 S, 1 D, 3 P. The latter is the term of largest multiplicity 
and is therefore the groundstate. For the configuration (ls) 2 (2s) 2 (2p)(np) 
(where n j* 2) all possible combinations of orbital and spin angular 
momentum are permitted, so that for each such configuration there exists 
three singlet and three triplet states. For n > 3 the configurations 
(\s) 2 (2s) 2 (2p)(ns), (]s) 2 (2s) 2 (2p)(nd) give rise to terms, which, if observed, 
are listed in Figure 16. We suppose, however, that the excitation of an electron 
from the filled (2s) shell into the (2 p) shell will give rise to additional discrete 
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terms. In order to determine the terms which arise from the electron con¬ 
figuration (ls) 2 (2s)(2p) 3 we shall first take note of the possible terms for three 
equivalent p electrons from Table 1: 2 P, 2 D, 4 S, that is, we have the following 
L, S pairs: 1, 2, 0, f. Note that one 2s electron gives rise to the term 

2 S, that is, to the pair: 0, From 0, \ and 1, \ we obtain the possible terms 
1, 1; 1, 0; from 0, \ and 2, \ we obtain the terms 2, 1; 2, 0; from 0, \ and 0, 
| we obtain the terms 0, 2 and 0,1. From the configuration (ls) 2 (2s)(2p) 3 we 
obtain the six terms 3 P, l P, 3 D, 1 D, 5 S, 3 S. The experimentally observed values 
are given in Figure 16. 5 S is the lowest energy state for these terms. It plays 
an important role for the chemical properties of carbon. 

If Figure 17 we present the energy level diagram for oxygen atoms. The 
electron configuration for the lowest energy level is (ls) 2 (2s) 2 (2p) 4 and gives 
rise to the same term manifold as does p 2 , that is, the same as we found for the 
carbon atom for the 1 S, 1 D, 3 P terms. Here again the last term is both the 
term of highest multiplicity and the ground state. These three terms in 
Figure 24 are not placed in the same column but are integrated within three 
term groups. These three term groups are easy to understand: Excited states 
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of the oxygen atom give rise to an electron configuration where an electron 
of the 2 p shell is excited to a higher energy level: (ls) 2 (2s) 2 (2p) 3 (ns), 
( ls) 2 (2s) 2 (2p) 3 (np ), (\s) 2 (2s) 2 (2p) 2 (nd), etc. where n > 2. In order to deter¬ 
mine terms for such a configuration it is first necessary to obtain the terms 
for three equivalent p electrons according to the table: 2 P, 2 D, 4 S. These also 
represent the lowest states for the ogygen ion. 2 P is the highest and A S is the 
lowest of these three terms. From the 4 S term (the left term group in Figure 
17) the (ns) electron gives rise to an 5 S and an 3 S term; from a (np) electron 
we obtain a S P and a 3 P term, etc. From the 2 D term of the ion the (ns) 
electron gives rise to a 3 D and l D term. The (np) electron gives rise to the 
following six terms: 3 P, 1 P, 3 D, 1 D, 3 F, 1 F. Finally, from the 2 P term we obtain 
the group of terms on the right in Figure 17. 

In Figure 18 we show the term scheme of mercury, the terms of which are, 
according to electron configuration given in Table 2, similar to that of helium, 
which is indeed the case. Here it is evident that the multiplet splitting is so 
large that it stands out in the energy level diagram, especially for the lowest 
3 P term. 


5 Selection and Intensity Rules 


From the atom’s dipole moment d = e Y£= i A we may obtain the representa¬ 
tion of the rotation groupJf we introduce the following linear com¬ 
binations of the components of d 


Ri — ^7= (d\ + id 2 ), 

Ro = ~d 3 , 


(5.1) 


R-i 


l 

75 


(~di + id 2 ) 


as already described in XIII, §3. Using this result, in XIII, §4 we have obtained 
the selection rule J -> J — 1, J, J + 1 for the total angular momentum 
quantum number and shown that only terms having different reflection 
symmetry properties, that is, only energy levels for which the corresponding 
values of YJ= 1 li which differ by an odd number can be combined to produce 
dipole radiation. If the parity of an energy level is ( — 1) then we append, to 
the term symbol a small “o” (see Figures 15 and 17). 

If the multiplet splitting is small, so that for all practical purposes the 
eigenvectors coincide with linear combinations of zeroth approximation 
(1.8) then it follows that (in the same manner as described in XIII, §4) the 
selection rule L-> L — 1, L, L + 1 applies, and that only terms of the same 
multiplicity 2S + 1 can be combined. 

If we may approximate the eigenvector by means of a Slater determinant 
(3.2) of one-electron functions then we obtain an approximate rule which 
states that transitions which do not satisfy the selection rule in which only 
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one of the /,■ may change by either 1 or -1 will occur with small probability. 
This result follows from the fact that d is obtained additively from the dipole 
moments er t of the individual electrons and that, for a transition the matrix 
elements must have at least one which is nonzero and that from XIII, §3 
it follows that l must satisfy / -> / + 1 or l — 1. 

For the case in which the multiplet splitting is small it follows that the 
selection rules J -> J — 1, J, J + 1 and L-* L— 1, L, L + 1 and S -> S are 
satisfied and that the transition intensities for multiplet transitions can be 
determined on purely group theoretical grounds. We will now consider the 
transition between two parent terms with quantum numbers L, S' and L, S. 
Since it is possible that L = L' we will distinguish both terms by the use of 
indices N and N'. The eigenvectors of both terms will, according to (1.8), have 
the following form: 

$NLSm L m s ~ ~r = X ’Atfmx.fc u m s k ■ (5-2) 

v r k 

For the individual fine structure components J the eigenvectors will be given 
approximately by the correct linear combinations (2.1) as follows: 

FjVLSM = X tmjft S)<f> NLSm ,m s ■ (5-3) 

mL + ms = M 

According to XIII, §2 the matrix elements 

(Rh)nLSJM M = (Tn'L'S'M'i ^n^NLSMy (5-4) 

must be identical up to a factor P^'lsj J t0 the expansion coefficient of the 
product representation of 2> l x 20j with respect to the representation 
Therefore, using XI (10.17) we obtain 


(Rn )nlsjm — Pnlsj J U-m^m'J' — Pnlsj J J , ( 5 - 5 ) 

The intensity of a multiplet transition J' -+ J, L -+ Lis, according to XIII (1.5) 
proportional to [1/(2 J' + 1)] X*,m,m' \(RX'lsjm M ' \ 2 - From (5.5) we obtain 


X 


/r> \N'L'S'J'M' 
K^liJNLSJM 


Pnlsj J | X \ a Mn-,M’j' \ 2 ■ 


(5.6) 


Since a MwMV Ts a unitary matrix, we obtain = 1 and we there¬ 

fore obtain 


1 


x V I CP \N'L'S'J'M' |2 _ j -N'L'S'J' t 

-)r/ , i Aj I \*^fx/NLsjM I I Pnlsj I 

“T" 1 


(5.7) 


The intensities of the individual lines J' -> J for the same L —► Lare therefore 
given by the | p N N £sj J I (since S' = S). 

According to XIII, §2, using factors we obtain: 


RA. 


NLSmLms XK 

AT 


‘(ILK 


NLS 




L+ 1 S'm^+uMs 


+ ^ I imJJR)' z NLS lS( l ) N'LSm L + nms 
+ lS( t>N'L -lSmt + zzms)' 


(5.8) 
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We therefore obtain 


m L + ms = M N' 


R^HLSM ~ Z Yjdm L m s (L, S)< Z ^M L (^^) T NLS S( l ) N f L'SmL +fims f * 


Since, according to (5.3) and XI (10.19), we have 

4 > N'L'Sm I , + tints ~ Y ^m + Ws(L SWnL'SM + n (5.10) 

from (5.5) it follows that 

p N N L L s?'d J Mtl (J\) = T N NL L s S Y d J mLms (LS)4' mL (lL) d J mL+ms (L, S). (5.11) 

m L + ms = M 

If, in particular, we choose M — J;M' = J’ and therefore /i = ./' - i we will 
find that 

Pnlsj J djj -j(J, 1) = t^,i S s Y d J mLms (LS) d J mL ^j._j ms (L'S)dj,- r mi (1L). 

mL+ms=J 

(5.12) 

Thus, from the preceding result and XI (10.15) we obtain 

1 SJ+ 1 

Pnlsj 

KL+S- J)(L+S-J- l)(J - L+ S + 2)(J - L+ S +T) 
Tnls V (2J + 3)(2J + 2)2L(2L + 1) 


JV'L — 1 SJ 

Pnlsj 


_ -N'L — is 

— t nls 


N'L- 1 SJ- 1 

Pnlsj 


l(L+ S — J)(J — L + S + 1 )(J + L+S+ 1 )(J + L- S) 
L(2L+ 1)(2J + 2)2 J 


ivL- is /(T + £> + S + 1)( J + T+ S)(J + L— S)( J + L — S — 1) 
~ Tnls V 2L(2L+ 1)2J(2J — 1) ’ 


r .N'LSJ+ 1 

Pnlsj 


n'ls /(-^ H" S — J)( J + ^ + S + 2)( J + L— S + 1)(J — L+5 + 1) 

T'NLS / T S'! T I 'T't/'l T I T I ’ 


L(3L+2)(2J + 2)(2J + 3) 


n N'LSJ 

Pnlsj 


r»'Ls J(J + 1) + L(L + 1) - S(S + 1) 
TNLS 2VL(L + 1)J(J + 1) 


n N'LSJ— 1 

Pnlsj 


,, LS 1(1+ S - J + 1)(J + L+ S + \)(J + L- S)(J - L+ S) 
,LS J L(2L+ 2)(2J — 1)2J 
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Figure 19 4 D -<• 4 P transition for ionized carbon. 


We need not make a special computation of the values Pnlsj ISJ since from 


|/n \N'L'SJ'M' 
I yf'-fj./NLSJM 


\ = \(R») 


NLSJM | 
N'L'SJ'M’ I 


it follows from (5.5) that 

I Pnlsj J I + | = Ipivx'sj'l (5-14) 

and, for M = 0, M' = 0 and p = 0 we obtain 

I Pnlsj 1 Moo(^l) = IPivuf'Sjl d J 00 (J'\). 

Thus it follows that 

ip^ isj 'i = l ^\\ P N N ffi ls A (5.15) 

In Figure 19 the 4 D —<• 4 P transition of ionized carbon at 6800 A is described. 
Here we recognize the separation of the individual fine structure components 
corresponding to the interval rule (2.5). The relative intensities of the lines 
are indicated by the corresponding thickness. 
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6 Zeeman Effect 


The rotational symmetry of the Hamiltonian operator for an atom can be 
broken by an external field. For an atom in an external field there are 
additional terms obtained from H 0 in VIII (5.8) given by 


2m 


1 


k= 1 


Z (Pi k) - eA v (Q ik \ t )) 2 + e<p(& k \ t) 

-V= 1 


and an additional term from H 5 in VIII (5.8). If we consider a constant mag¬ 
netic field consisting of only a B 3 component, then the Hamiltonian operator 
will contain an expression of the form A x = —i B 3 x 2 , A 2 — jB 3 x u A 3 = 0 
for each electron. We therefore obtain 


+ ^B 3 QmJ + (pf -*B 3 Qfj 2 + P<‘> 2 ] 

= t [“ P (k)2 + PoB.Lf + ~ (Qf> 2 + e? )J )BlJ , (6.1) 

where Ho = — ejlm. From H s in VIII (5.8) we obtain the additional term 

HqB 3 = 2S 3 . (6.2) 

If we neglect the case of large magnetic fields, then it suffices to consider only 
linear terms in B 3 ; we then find it necessary to add the term 

H' = HoB.iL, + 2 S 3 ) (6.3) 

to the Hamiltonian operator for a free atom. If we compare (6.3) to XI (1.10) 
we find that, for electrons with spin (including the case of magnetic di- 
poleradiation), it is necessary to replace the magnetic moment m = HqL by the 
operator h 0 (L + 2S). 

Let denote the exact eigenfunctions for an energy level of a free atom. 
They span an eigenspace 2Tj which is invariant under the representation 
of the rotation group . The entire Hamiltonian operator including H' is, 
however, only invariant under the subgroup of rotations about the 3-axis. 
The eigenspace 2T } is not irreducible under the action of this subgroup. It 
decomposes into the 2 J + 1 one-dimensional subspaces which are spanned 
by the individual F J M , since, under rotations about the 3-axis each is 
multiplied by a factor Thus the external field breaks the (2 J + l)-fold 
degeneracy of the atomic energy levels. 

For the magnitude of the splitting we may only obtain simple assertions 
in the extreme cases. If the magnetic field is so weak that the splitting is small 
compared with the separation between individual fine structure components 
or adjacent energy levels of the free atom, then the splitting can be computed 
by perturbation theory. Here, for the eigenfunction we choose the “correct” 
linear combination which must be given by the T J U . From (6.3) we obtain 

H' = ho B 3 (J 3 + S 3 ). (6.4) 
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We obtain J 3 T J M = MF j m . The matrix elements <T^, S 3 must be deter¬ 
mined by group theoretical methods, because the theorems from XIII, §2 are 
applicable to the S k r J M and the J k T J M . Thus we obtain 

<Tm', s 3 r J M ) = j 3 r J M ) = xM 5 m , m . (6.5) 

Therefore H' has the form of a diagonal matrix (1 + x)M5 m . m with respect 
to T J M as a basis of the eigenspace that is, the splitting has the magnitude 

e M = (1 + t )Mg 0 B 3 = gMfi 0 B 3 . (6.6) 

The energy level with the total angular momentum quantum number J is 
therefore split into 2J + 1 equidistant terms corresponding to the different 
values M of the third component J 3 of the total angular momentum. 

The factor g = (1 + t) may be simply determined in the case in which the 
T J M are, in good approximation, given by the linear combinations (5.3) 

In the following we shall consider the operators S arnLL only in p for the 
computation of the value of g, that is, instead of S, L we shall consider 
PjSPj, PjLPj where P, is the projection operator onto 2T } . According to 
the above result, in ST, we obtain PjSPj = xPjJPj = xJPj. Thus in we 
obtain 


PjSPj -J = xJ-JPj = J- PjSPj = xJ(J + 1)P,. 


On the other hand, we obtain 

P = (J - sf = P - J -S - s J + s 2 . 


For the T^ of the form (5.3) we obtain L 2 T J M = L(L + and SP^ = 
S(S + 1)Fm. Since PjJ SPj = J ■ PjSPj = xJ(J + 1 )P } , we obtain the 
Lande factor 


9 


t , J(J + 1) + S(S + 1) - L(L + 1) 

1 + x = 1 + —---------- 

2 J(J + 1) 


(6.7) 


For singlet terms, that is, for S = 0 we obtain J = Land therefore g = 1. In 
Figure 20 the Zeeman effect for the sodium D line is illustrated. Not all the 
transitions M' -» M are permitted. Since, under rotations about the 3-axis, 
the R u R 0 , i?_, are multiplied by e~ ia , 1, e ia , only the matrix elements 
(R^m m are different from zero only for M' = M + g, that is, the selection 
rule M -> M — 1 ,M,M -I- 1, a result which we have already obtained in (5.5). 
For a transition J' -+ J we may now easily state the intensity ratios of the 
individual Zeeman components, because, from (5.5) it follows that 

(« M = Pnlsj J ( 6 . 8 ) 
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Figure 20 Zeeman effect for the sodium D line. 
We therefore obtain 


/n \N'L'SJ — 1M+ 1 _ N'L'SJ- 1 M + 1) 

Jnlsjm — Pnlsj J 2J(2J + 1) ’ 


\N'L'SJ-1M _ _ .N'L'SJ- 1 /(^ 


/n \N L SJ- IM _ .N'L’SJ 

v ^ o Jnlsjm ~ ~ Pnlsj 


J(2J + 1) 


/n \N'L'SJ- 

i Jnlsjm 


N’L'SJ- 1 M- 1 


Pnlsj 


N'L'SJ — 1 /(■/ + M)(J + M - 1) 


2J(2J + 1) 


/n \N'L'SJM+l _ _ N'L'SJ 

y^i Jnlsjm — Pnlsj 


/n \N L'SJM _ -N'L'SJ 

v^o Jnlsjm — Pnlsj 


KJ + M + 1 )(J - M) 


2J(J + 1) 


M 




/ p \N’L'SJM — 1 _ N'L'SJ + M X J M + 1 ) 

(K-iWm -p™ J 2J(J + 1) ’ 


/ n \N'L'SJ + 

y^i Jnlsjm 


Ld kJ*J I 

Pnlsj 


/n \N'L'SJ + IM __ .N'L'SJ + 1 
NLSJM — P NLSJ 


/n \N'L'SJ+ IM- 1 _ N'L'SJ + 1 

vA -1 Jnlsjm — Pnlsj 



+ M 

+ 

2)(J 

+ M + 

0 


(2J 

+ 

0(2J 

+ 2) 


/(J 

+ M 

+ 

1)(J 

- M + 

0 


(J 

+ 

1)(2J 

+ 1) 


/(J 

- M 

+ 

2)(J 

- M + 

1) 


(2J + 1)2( J + 2) 


(6.9) 
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It is easy to show, on the basis of XI, §1, that the R lt jR_, correspond to 
circular polarization of the radiation observed in the direction of the 3-axis 
(a component) and R 0 corresponds to a linear polarization orhtogonal to 
the 3-direction observed in the direction of the 3-axis (n components). 

If the spin perturbation is so small that the for the free atom are given 
by the linear combination (5.3), and the magnetic field B, is so strong that 
its perturbation is no longer small compared to the spin perturbation, then 
both perturbations must be handled at once, that is, we must proceed from 
the parent terms with the eigenspace t LS spanned by the (j) mLms . Let H" denote 
the additional term of the Hamiltonian operator which takes spin into 
account. For the linear combinations T J M we therefore obtain H'T J M = 
f[J( J + 1) - L(L + 1) - S(S + 1)] according to (2.3). On the other side, 
it is clear that 


H'4> mL . ms = ^oB 3 (m L + 2m s )(l> m L ms ( 6 . 10 ) 

and therefore 

HT’ m = h„B 3 Z (m L + 2rn s )d^, ms 0 mtms . (6.11) 

wi£, + ms = M 

With 


</v 


= y d J r J 

Lu m^ms m L + ms 


we therefore obtain 

HTit = h 0 B 3 


Z 

fUL + ms - M J' 


Z (m L + 2m s ) d J mims 


( 6 . 12 ) 


For each M = — J, — J + 1,..., J it is necessary to solve the secular 
determinant for the eigenvalues s M of the perturbation (with roots s ML+s , 
Zml+s -i , • • •, %|l-si corresponding to the values J = L + S,.. ., \L — S|): 

|{f[J(J + 1) - L(L + 1) - S(S + 1) - e M }8jj. | 

+ Ro B 3 Z (™l + 2 m s ) d J mi 


ms + rtiL = M 


d J ' 

*m L ms ^mjjns 


= o, 


(6.13) 


where the matrix has the indices J, J'. If we multiply this matrix on the right 
and on the left by the unitary matrices a uv .. Mr from XI (10.17), we obtain the 
equivalent secular equation 


Oo B 3 {m L + 2 m s ) - e M ld msfls d mLllL 

+ f Z + i) - l(l + i) - s(s + l)] d’^ d J muns ~ u ’ 

j 

where m L + m s = n L + n s = M and the pairs m L , m s ; n L , Us arc to be con¬ 
sidered as the matrix indices. 

For very small B 3 from the first secular equation it follows that 


s M j = r[J(J + 1) - L(L + 1) - S(S + 1)] 

+ Mo B 3 Z ( m L + 2m S) d mortis dyn^ms ’ (6.15) 

wj J + ms = M 
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where the latter again leads us to the result (6.6) with the Lande g factor (6.7). 
Conversely, if f ^ p 0 B 3 , then, from the second form we obtain 

% ms = P o B i(M + ms) 

+ f[-L(L+ 1) - S(S + 1) + I J(J + 1 )ti- msms d J M -m sms l 

J (6.16) 

The essential contribution gives p 0 J? 3 (M + m) for the splitting. Each of the 
Zeeman components arising from the gross structure energy level exhibits a 
fine structure splitting f [• • •]; without the term proportional to f, all energy 
terms with the same sum M + m s = m L + 2m s will coincide. 

The transition from the first to the other extreme case is called the Paschen- 
Back effect. 

For the case of the alkali metals (S = j, and therefore J — L + j, L — j) 
(6.13) and (6.14) are two-dimensional secular equations. We can solve these 
in an elementary way, and discuss the splitting as a function of B 3 (for fixed f). 

7 / Electron Problems and the Symmetric Group 

Until now we have not investigated the representations of the permutation 
group in Jt?{ because the coupling of these representations with spin angular 
momentum has made it possible to solve most problems of interest without 
requiring an explicit knowledge of these representations. In order to make 
the behavior of the energy levels more evident for the case of / electrons we 
will now examine the representations of the permutation group in more detail. 

Let i be an n-dimensional vector space (not necessarily a Hilbert space). 
We shall now investigate the representation of S f in the space ST — t f . If 
(p v is a basis of t, then the vectors of T are given by 

<£= I ■ ■ ■ <Pv f (J)- (7.0 

Vi •••V/ 

P(f> is defined for a permutation P in XIII, §1. The representation of Sy in ,T 
may be completely reduced (see AIV, §14 and AV, §10.6). We can adopt the 
basis vectors of 3~ for this reduction in the form (see AIV (14.7)): 

ST = £ + / v) x .* <v > (7.2) 

V 

in order that the transformations of S f have the following form: 

p = Z <2v(l X P^)Q V . 

v 

The Q v are the projections onto the subspaces / (v> x 4 <v) of ,T. The P (v) yield 
irreducible representations of S f in 4 <v) which are not isomorphic for different 
v. With a basis u*, v) of /> W) and v ( J ] of t (v) we have u ( ^ = as a basis for .T. 
Let /D'* = ^ <V H V) and l { p = u<, v V v) . ft? is spanned by the u { p a (p fixed) and 
l { p by the u^„ (a fixed). The transformations A in .T which commute with all 
the P in S f are of the form £ v Q V (A (V> x l)Q v where the A (y) are arbitrary. We 
shall denote the ring of such transformations A by (LS). Thus ST is completely 





152 XIV Spectra of Many-Electron Systems 


reduced by (7.2) relative to the ring (LS). According to AV, §7 all operators 
which commute with all operators from (LS) are given by the elements 
£ P a{P)P of the group ring of Sy. 

The transformations from (LS) will be called symmetric transformations. 
From 


A(p Vl (l) ■ ■ ■ cp Vf (f ) = £ <P M 1 (1) • • • ^ / (/K 1 ... w ,„ 1 -v / (7-3) 

we easily see that, for symmetric transformations, 

— a n X i--fi «/;»<!"■»«/ (7.4) 

holds, that is, the matrix elements are not changed if we subject the left and 
right sets of indices to the same permutation. 

Let J£„ denote the ring of all linear transformations in 4 . Then we get a 
representation of £ n in £ which we denote by £{. £{ is then a part of (LS) 
because if a is an element of £„ which satisfies cp v = £„ <p„ then a is an 
operator in £ defined by 

a<p Vl (l) • • • <Pv//) = £ </>„,(!) • • • ^ / (/K 1 v 1 «„v 2 •' • <Wv/ (7-5) 

and (7.4) is satisfied. We now assert that the operators in (LS) can be expressed 
as linear combinations of operators in £{. Instead of the two indices /r i; v f 
we shall write Z ; as an abbreviation. We shall abbreviate the sequence 
l j, Z 2 ,..., If by the index /.The matrix coefficients a lll ... llf3Vl ... Vf = a h ...i f = a t 
of all symmetric transformations (as a sequence of numbers) form a finite¬ 
dimensional vector space, a subspace of the vector space of all transformations 
fl /uviThe assertion is therefore that the special vectors 
a; = a h a l2 ■ ■ ■ a,„ of the transformations from S£{ span the subspace of all 
symmetric transformations. The subspace spanned by all the a, is given by 
all the vectors b t which satisfy £j x l b l = 0 if £, x,a, = 0 is satisfied for all 
a, e £{. If we show that £, x r x, = 0 implies that £, x t a t = 0, the theorem is 
proven. 

From 

£x,oc ( = X x llh ... lf a h a l2 - ■ ■ a lf = 0 (7.6) 

l h"lf 

for arbitrary a t it follows that all coefficients of the same power in a u in (7.6) 
must vanish. A particular member a h a h ■ ■ ■ a u repeatedly occurs in (7.6), and 
its coefficient is equal to 

EiX-ir (7-7) 

p 

This coefficient must be equal to zero. The elements a ili2 ... (/ of (LS) satisfy 
Pa u ... lf = a h ... lf and we obtain 

£ x h - i f a ii -i f ~ 0- (7-8) 

h - If 

Since all elements of (LS) are linear combinations of elements from the 
/ <v) are also irreducible as representation spaces of £{. 
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The elements of (LS) are not only linear combinations of elements of S? } „ 
but also of U{ where U„ is the group of all unitary transformations in t. We 
can also impose the condition that the determinants of the elements of U„ are 
equal to 1; these groups are then called SU„ (the special unitary groups). 

Proof. Let a vp be the matrix of an element from U„. The condition \a vp \ = 1 does not 
play an essential role; if |a VJ1 | = a then s = 1. Since the form £,x,a, is 

homogeneous, we may neglect the factor a 1/n . From 

V ~ X X hl2 'lf a li a h - a if ~ X */*lVlM2V2"'M/V/ a MlVi a /l2V2 ' ’ ' a HfVf 
l fiv 

= 0 (7.9) 

it follows that 

_ 3a> 

E ~~ da„, = 0. (7.10) 

fl\ 

Since the matrix a = (a pv ) is unitary, from (a + da) = a(l + a + da) and 

(a + da) + (a + da) = 1 where da = iada, it follows that (1 — i<5a + )(l + iSa) — 1 

and that da is a Hermitian matrix. From (7.10) it follows that 

dtp 

= o, (7- 11 ) 

vnp 

where 5a = ( 5a pv ). If B pfl = Ev (o<p!da vfl )a vp , then for arbitrary Hermitian 5fi pp it 
follows that 

E B pMph = °- 

PM 

If the 5p pii are completely arbitrary we find that 

5\r = 5p pp + Ip pp and 5 2 cc pp = i(5p pp - 5j pp ) 
are Hermitian matrices. Thus, for arbitrary 5[] pp it follows that 

E B eMw = °» 


that is, the B pp = 0. We therefore obtain 

v d( P n 

v oa vp 

Since the matrix a vp (as a unitary matrix) is not singular, it follows that 


Here we note that 8(p/da w is homogeneous polynomial (like <p) but of degree 
/ — 1. By induction it finally follows that tp = 0, that is, the coefficients x, r .. lf must 
satisfy the same relationships (7.7). 

The // v) are therefore also irreducible subspaces for the representation 
(SU n ) f of SU„. If the space i is a spin space, then ( SU 2 Y is the representation 
(£^i/ 2 Y of the rotation group. Thus we have proven the theorem used in §§1 
and 3 that for the spaces in (1.3c) for / electrons the representation A' of 
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the permutation group uniquely corresponds to the spin angular momentum 
quantum number S. 

The reduction of 0~ in the form (7.2) is closely connected to the group ring 
P# Sf . Instead of Pt Sf we shall consider the ring PA* of all transformations which 
commute with ( LS). 01 Sf is homomorphically or isomorphically mapped onto 
01*. 01* is the ring of all transformations of the form <2„(1 x B (v) )Q v . The 
ring 0t* may be easily decomposed (for the general case see AV, §7) into left 
and right ideals as follows: All elements of 01* may be written as follows: 

I (7-14) 

vpa 

where = d Vfl S aa 'uP P . For fixed v and a the e*ft form a basis for an 

irreducible left ideal /* {v) of PA *; for fixed v and p the e*P form a basis for an 
irreducible right ideal t* M . The eftft are idempotent elements and /* (v) = 
P2*e* (v) , i* iv) = e* p p ) 0l*. The i* (v) are uniquely related to the ftp — ft^vft = 
>i*ft ] pr = e*ft ] 0~, a result which follows directly from the operators e*P 
described above. The decomposition of the unit element in 0t* into irreducible 
idempotent elements e = v eftft yields such a decomposition of 0~ into 
invariant irreducible ftp with respect to (LS), SPft U{ and ( SU n ) f , respec¬ 
tively. 

If the dimension n > f then the vectors P<p Vl (l)<p V2 (2) ■ • • tp Vf (f) (v ( # v k ) 
(where the P are all elements of S f ) span an (/ !)-dimensional subspace of 0~ in 
which the representation of S f is isomorphic to the regular representations 
of Sy in 0t Sf . 0t Sf is therefore isomorphic to the ring 01* because not more 
than one element of S f can be mapped onto the unit element of 01*. Thus it 
follows that ftp = Pp.T = e { p.T ^ (0) where eft, are irreducible idempotent 
elements and Pp are irreducible right ideals of Pt Sf - 

If n < f then 0t Sf is only homomorphically mapped onto 01*. The elements 
of P$ Sf which are mapped onto the null element of 01* clearly form a two-sided 
ideal y. From 0 Sf as an additive group with operators we may construct the 
factor group (factor space) 0tsj?. It easily follows that = &*■ 9 is 
characterized by y.T — (0). Since is completely reducible it follows that 

0^s f — 9 + A 

where t is clearly a two-sided ideal and t = 0 Sf l9- Only the irreducible right 
ideals lying in t and its idempotent elements eft, gives rise to invariant sub¬ 
spaces ftp = eft r T. If the irreducible right ideal Pft lies in t then all the 
ifteft = tft lie in / because 9 is a two-sided ideal. Thus for a given v either 
all equivalent irreducible right ideals (and therefore all eft (v fixed)) lie in 
/ or none do. 

The inequivalent idempotent elements of Pi Sf are, according to AV, §9, 
given by a tableau T k as y k I k K k . We may set 

efl = y k hK k . (7.15) 

(The two-sided ideal spanned by the tft (k fixed) consists of all elements of 
the form Pi Sf ef\Pi Sf .) From the meaning of K„ it follows that ef\ 9~ = 0 if, 
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in the tableau T k there are more rows than the dimension of t since we would 
then obtain K k ST = 0. We therefore obtain a sufficient system of idempotent 
elements if we restrict ourselves to tableaux with not more than n rows. 
Other representations of S f which do not belong to these tableaux cannot 
occur in .T. 

If 4 is the two-dimensional spin space, then in i f we find only the representa¬ 
tion of Sj which belong to tableaux with at most two rows. We shall now 
describe how we may explicitly obtain the spin function matrix of the u^l for / 
electrons. For this purpose we shall consider such a tableau with at most two 
rows. The first row contains g positions more than the second, that is, the 
first row has l + g, the second has / where 21 + g = f. We seek to obtain the 
subspace ftp of which corresponds to this tableau. Such a ^ can be 
obtained from the idempotent element which corresponds to the tableau. 
On the basis of the structure of this element, as described in AV, §9, we are led 
to the conjecture that a space /j, v) is obtained from the “spin invariants” 

i 

A = n [m + (v)u_(Z + g + v) - w_(v)w+(/ + g + v)] 

v= 1 

• n [u+(l + n)x + «_(/ + g)y~\ (7.16) 

by means of the coefficients v s M of 


x S + M y S-M 


/(S + M)\(S - M)! 


where 2 S — g, M = —S, — S + 1,..., S; for if we apply the permutation s 
(where s is given by AV, §9) we then obtain sA = (— 1)M and we find that 
KA is a multiple of A. If we then apply a permutation r then we find that A is 
not mapped into itself, and that we may apply a symmetrization / = r to 
A. This is, however, not necessary because, by analogy with XI, §9, it is easy 
to see that under rotations the coefficients of Xf, transform according to & s . 
Thus we have indeed found a space From the v s M (M fixed) we obtain an 
entire space which, transforms according to the corresponding representa¬ 
tion of S f when we subject v s M to all permutations. This follows from the fact 
that the vector Iv s M = (JT, r)v s M must also be in such a space 2$ and that 
the 2$ must be irreducible, because they correspond to a spin quantum 
number S. If we wish to find the representation <2> s and the corresponding 
representation A of S f only in a formal way, then in the expression (7.16) for 
A we may formally set w a (l) = u x ( 2) = • • • uJJ + g) = and u^l + g + 1) = 
M ct(Z + g + 2) = • • • = u a (2Z + g) = v a . If we then apply J\ r r to A we will 
obtain a multiple of A. 

We will now use the above results to discuss the coarse structure of energy 
levels associated with an electron configuration, without using the spin 
functions, as we have done in §3. We will directly investigate which spaces 
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/ v) x 4 <v) in (7.2) can arise from an electron configuration. Let the initial 
electron configuration energy level be given by 

s = aifWi + a 2 £ »2(2 + •' • + (7.18) 

and the corresponding electron configuration is given by 

(n l l i r(n 2 i 2 r---(n s l s r, (7.19) 

where a x + a 2 + • • • + a s = / In the Hilbert space of an electron (without 
spin) let 

* ~ "Wx © ^2*2© ' ‘ ‘ © K s i s > (7.20) 

where t njli is the eigenspace corresponding to the eigenvalue s n<li . We now 
choose ST = t f . does not only include the above term e in (7.18) but also 
other combinations of the s„. u . We will now see that it is easy to choose the 
subspace in ZT which is the eigenspace corresponding to the energy level 
(7.18). In ZT we may think of S f as being reduced so that we obtain the form 
(7.2). The spaces /. <v) x t <v> are also uniquely characterized by the representa¬ 
tions of U„. All vectors of the same correspond to the same eigenvalue of 
H 0 (neglecting the Coulomb interaction of the electrons), that is, a 
corresponds either to the entire energy level (7.18) or does not correspond 
at all. What is the multiplicity of the energy level e (7.18), that is, how many 
of the 7 ( p v) correspond to s? To answer this question we consider the group 
U„ in i. Let u v be a basis in t which corresponds to the decomposition (7.20). 
All elements of U n are conjugate to the diagonal transformation 

Au v = e v u v . (7.21) 

According to (7.18) all of the eigenvectors u vi (l)u v2 (2)... u v/ (f) with eigen¬ 
value e will be multiplied by f] v e v in where the s v corresponding to the 
vectors u v in occur with multiplicity a ( . If X (v) (e 1 , e 2 , ...) is the character 
of the representation of U„ in the / (v> , that is, in the ^ (v) of / <v) x t (v) then X (v) 
is a homogeneous polynomial of /th order in the e v . Thus we may uniquely 
decompose X (v> into X ( 1 V) + X^ where X[ v> contains all members of X ( ' :) 
which are of order y. t in the s v corresponding to the vectors u v in t„ iU . Thus 
A ( ! V) (1, 1,...) is the multiplicity with which the term e occurs in /, <v) . 

We will now proceed from U n to the subgroup of rotations, the representa¬ 
tion of which decomposes in the irreducible representation 

+ ••• + 

then the / (v) are no longer irreducible. The subspace o (v) of / <v) corresponding 
to the energy level s is an invariant subspace with respect to the representation 
of the rotation group because, under rotations, the vectors of the i niU transform 
only among themselves. The character for the representation of the rotation 
group in <Z X) is then equal to e 2 ,..., e s ) if we replace the s Vk corres¬ 

ponding to the vectors « Vl , m V2 , ..., u V2l+1 of by e“f, e ill '~ 1,a ,..., e~ i>a . 
Setting e = e ia we then obtain X ( x } (s u e 2 ,..., e s ) = T (v) (e). Now, in order to 
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reduce the rotation group in /. <v) we need only decompose Y' v) (e) into 
irreducible characters 



We set (s — 1) Yjm= -l gm — e L+1 — £ L — <L). It must therefore be possible 
to write 

(«- l)F (v) (a) = £a L <L>, (7-23) 

L 

where the a*, are integers and a*, > 0. Then the a L are the multiplicities with 
which the energy level with orbital angular momentum L occurs in the 
/t v) corresponding to the (v)th representation of S f . From (7.23) it follows 
that 

Z a t< L > = £ a r(s L+1 “ e _L ) 

L L 

= a 0 s + ajfi 2 + a 2 e 3 + • • • —a 0 — a 1 £~ 1 — a 2 e~ 2 — • • •. 

Therefore, for y <v, (e) = X!v b v £ v we obtain 

a L — b L — b L+ 1 = — b_ (L+1) . (7.24) 

The determination of the gross structure terms (the so-called parent terms) 
arising from an electron configuration is, in principle, solved in this way if 
we know the characters of the irreducible representations of U n in ^ <v) which 
uniquely correspond to an irreducible representation of S f , that is, uniquely 
correspond to a tableau in AV, §9. We will seek these characters in the next 
section. We will now show that the representations in t <v) associated with the 
representation in spin space are given by a tableau (AV, §9) having two 
columns—more generally: Two associated representations belong to two 
tableaux which are obtained from each other by exchanging rows and 
columns. 

Proof. If e = Y, P e (p)P is idempotent and generates the irreducible left ideal l in 3t Sf by 
means of the equation — l, then the trace of an element q in the representation 
given by l is equal to the trace of the transformation x -> qxe in the entire 0t %! because 
it is identical to the null transformation outside of l and within l from xe = x is 
identical to the transformation x -» qx. We therefore obtain: 

x(.q) = (7-25) 

t 

According to AV, §9 the idempotent element corresponding to a tableau has the 
form e = y 5Z r r (— l) s s = y^,(- l) 5 rs. The idempotent element corresponding 
to the tableau obtained from the previous one by interchanging rows and columns 
can be chosen as follows: 

= (-l)V = y I (- iyv- ■V- 1 =?!(-mrsy (7.26) 

rs r's' rs 

Thus we find that e*(p) = ( — 1 ) p e(p~ *); from (7.25) we obtain: 

x *(q~i) = x *(q) = (-1 )«x(q). 
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We will now simplify the search for the gross structure of the energy levels 
by considering the composition of two groups of electrons. Let us consider 
two groups consisting of and f 2 electrons where / = /i + f 2 ■ We shall 
decompose t f as follows: i f = t fl x t f2 . In the group S f we shall consider 
the two subgroups <r u the permutation group for the J\ electrons and a 2 the 
permutation group for the f 2 electrons, respectively, and the subgroup 
(ji x <r 2 containing all mutually independent permutations of the f x and f 2 
electrons. 

In (7.2) the spaces t (v> , under the restriction of the group S f to the subgroup 
<r, x a 2 are no longer irreducible; instead, the representation of a 1 x a 2 in 
4 <v> is split into irreducible components, where each component is of the form 
A x * A 2 where A x and A 2 are irreducible representations of <7, and a 2 , 
respectively. Not all transformations which commute with the operators in 
<xi x a 2 are of the form Q v (A (v) x 1 )Q V , because equivalent representa¬ 
tions of cr 1 x a 2 can occur in different t (v \ For this reason we find it necessary 
to consider the groups a x and U{' in .T x = t fl which permit a decomposition 

x = Z /3 + A P) x i<P> similar to that in (7.2). The ftp are irreducible 
relative to U n and the t ( p are irreducible relative to a x . We shall also do the 
same for i f2 = 9~ 2 — + Ai x It follows that the decomposition of 

,T is given by 

r = E + (A p) x A° ] ) x (A a) x (7.27) 

pl«T 

The representation of a y x a 2 in i { { ,) x i 2 ] is irreducible and has the form 
Aj * A 2 . All A which commute with a x x cr 2 are of the form 

a = QM p ' a) x m P o 

where A (pa> are arbitrary transformations in /t (p) x U„ is represented in 
/ <p) x /t (a) by the representation J^ ip) x .# (<T) where ,# <p) , are the 

irreducible representations of U„ in / ( 1 P) , fc { 2 \ respectively, if (p) x is not 
necessarily irreducible; if reducible then it can be decomposed into several 
irreducible components, each of which is uniquely related to a representa¬ 
tion of S f in as follows: ftp x ftp — ft v) with some of the ft v) in (7.2). 

If we compare the two decompositions of (7.2) and (7.27) we can make 
the following conclusion: If the irreducible representation A (v) of S f in t (v> 
contains the irreducible representation A[ p> * A ( p of a x x a 2 b times, then 
the reducible representation Jf ( p x of U„ corresponding to A^* x A ( 2 0) 
contains the irreducible representation of U„ in ft y) corresponding to A (v) 
exactly b times. 

We will now show that this fact permits us to obtain the energy levels 
arising from a given electron configuration in several steps (in §3 a different 
method was outlined). In the first step we treat a collection of equivalent 
electrons as a whole. In the second step we combine groups of equivalent 
electrons. 

If, for a given electron configuration, we have two groups of electrons for 
which no electron from the first group is equivalent to an electron of the other 
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group, we may then make use of the subgroup a 1 x a 2 instead of the group 
S f . An irreducible representation A, * A 2 of a l x a 2 in t{ occurs in the 
representation A corresponding to the spin quantum number S either once 
or not at all, depending whether S occurs in the values 

Si + S 2 , Si + S 2 — 1,..., (iSj — S 2 1, 

where Si, S 2 are the spin quantum numbers corresponding to A t and A 2 , 
respectively. Since the associated representations are essentially the same, 
except for a factor of ( — 1), it follows that the representation A' of S f in the 
orbit function space t f contains A\ x A' 2 with the same frequency as A 
contains the representation A x x A 2 . 

We may now use this result to find spaces of the form / ( 1 v) x and 
//^ x iff for the two groups of electrons separately which correspond to the 
representations A' l5 ,@ Ll and A 2 , Q) L2 . Since none of the electrons from the 
first group is equivalent to any of the electrons in the second group, we may 
construct the product space ftp x ftp x 4 ( 1 v) x ftp corresponding to the 
representations A' t * A 2 and S) L{ x 3> Ll . To A' t and A 2 there are corres¬ 
ponding representations A t and A 2 in spin space with spin quantum numbers 
Si and S 2 . As we have seen. A', * A 2 occurs exactly once in the representa¬ 
tion A' of S f providing A corresponds to one of the values 

5 = Si + S 2 , Si + S 2 — 1,..., |Si — S 2 |. 

For a given A the energy levels are split by the Coulomb interaction into 
different energy levels with different orbital angular momentum 

L = + L 2 ,Li + L 2 — 1,..., |Li — L 2 |. 

Thus we have shown that the study of atomic energy levels may be reduced 
to the study of groups of equivalent electrons, as described in §3. 

The above considerations permit us to obtain a rough picture of the nature 
of the chemical bond, that is, of the processes by which atoms bind to form a 
molecule. Let us consider the atoms at large distances being slowly brought 
together. The energy levels of the electrons in the field of the nuclei depend on 
the relative positions of the nuclei. If, for a certain position of the nuclei there 
is a relative minimum for the energy, then the atoms may then combine to 
form a molecule. A more precise determination concerning the possibility 
and the type of such a bonding requires a detailed quantitative evaluation of 
the energy levels (see XV, §5). We shall now provide a rough description: 

Suppose that the ground states of the atoms belong to the representations 
Ai, A 2 ,... of the permutation groups of its electrons. Let their corresponding 
spin functions be characterized by their spin values S u S 2 ,..., and their 
representations A 1# A 2 of the permutation groups. Then, as the atoms are 
brought together these ground states give rise to energy levels of differing 
symmetry A', namely those A' found in the representations A' t A 2 • • •. 
According to the results obtained above, only such A' can be found which are 
associated in the spin space with the representation A corresponding to 
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the spin numbers S which are obtained from the reduction of 3> Sl x 2ft Sl x • • •. 
If we call = 2S, v 2 = 2 S 2 ,- ■ ■ the valence of the atomic states, then we 
obtain the possible valences of the molecule directly from the combination 
possibilities for the valence lines, a problem which is equivalent to the problem 
of the reduction of 2ft Sl x 2ft Sj x • • •. If we then apply a perturbation com¬ 
putation by analogy with the computation presented in §9, we will find 
that, for the most part, the lowest energy level is the state in which there are 
no free valence lines v = 0. We will present a detailed discussion of this topic 
in XV, §7. 

8 The Characters for the Representations of S f and U n 

In the space y = t f we shall now compute the trace of a transformation PA 
where A, P are elements of U n and S /; respectively. For A let us choose a 
diagonal representation 

Acp v = e v <p v , (8.1) 

where <p v is a basis for For .T we may choose two different systems of basis 
vectors: First the <£ V , V2 ... V/ = <p Vl (l)<p V2 (2) • • • q> Vf (f) and second, the 
obtained from the decomposition (7.2). Here we obtain 

PA(p Vl ... Vf = fi Vi e v2 ' ' ’ £ V/^ > V ai Va 2 -"V ot/ ' (^-2) 

We will obtain diagonal elements only if the permutation P of the indices 
Vi- - -v f does not change the sequence of the indices. If P # e this can be only 
if some of the indices are identical. Which of the index sets v t • • • v f will be 
transformed into itself under P? Let P be described in terms of a cyclic 
representation (see AV, §9) where y u y 2 , ■ ■ ■ are the lengths of the cycles. 
Here we find that P transforms an index set into itself if the individual cycles 
permute identical indices. The vectors which are transformed by P 

into themselves form a basis for the vector space 

x x •••, (8.3) 

where, for example, i, n is spanned by the vectors </> v (x 1 )<p v (x 2 ) • • • (p v ( x yi ) 
where x l5 x 2 , ■ ■., x yi are the numbers in the cycle y , of P. Therefore we obtain 

tr (PA) = tr yi (A) tr n (A) • • •, (8.4) 

where tr n (A) is the trace of A in t yi . It is easy to see that this is equal to 
s yi = Yj"= i e l l - Thus we find that 

tr(PA) = s n s y2 ■ ■ •. (8.5) 

If we use the basis u^v^, we obtain 

Au<;V a " = X (8.6) 

A 

and we find that 

PAu^vy = X u^W x y;>P ^. (8.7) 

Xp 
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From these results, we find that 

tr (PA) = 11 = X x (v> (P)X (v) (8 1 • • • £„), (8.8) 

v an v 

where x (v) is the character of the representation (v) of S f and X (v) is the 
character of the corresponding representation of U„. 

From (8.4) and (8.8) we therefore obtain the important relationship: 

Yx (v W M (e l ---s n ) = s 7l s 72 ---. (8.9) 

V 

If we are successful in computing the ••■£„) then we may also compute 
the y <v) (P) from (8.9) by using the orthogonality relations (AV, §10.5). Since 
A is unitary, e v = e iXv where a v is real. Since all A of the form (8.1) are diagonal 
matrices, they commute; thus we may choose the basis vectors u K v) in // v) (7.2) 
such that the operators A are also diagonal in the representation in /X'K 
Instead of (8.6) we may express the operators A as follows: 

(8.10) 

with a v from (8.1). 

Since this provides a representation of A, it follows that if Bcp v = e Wv (p v , 
then from AB(p v = e i(tt ' ,+Pv) (p v it follows that 

/i v) («i • • • tO/i v) 03i •••/?„) = /i v) («i + + Pn)- (8-11) 

Thus it follows that if 

^(a,) = /f(0,0,...,a„0---0), 

fkX ai, •••,«„)= n 
1 = 1 

gVM, + Pd = gtiWdXPd, (8.12) 

and 

g&( 0) = i- 

Since the g ( u( r x t ) are periodic in a, and have magnitude 1 it follows that 
gu (ad = e ihm where h, is an integer; we therefore obtain 

/r(a 1 ,...,a n ) = ^=^‘. (8.13) 

X <v) (ej • • •£„) is therefore a sum of positive and negative powers in the 
£ v = e ictv with positive integer coefficients. Since two elements for which {e v } 
differ only in the order belong to the same class of conjugate elements, 
X (v) (£ 1 • • •£„) must be a symmetric functions in the arguments e v . 

As we have shown in AV, §10.5, the “volume” element for a class of con¬ 
jugate elements for a k between a k and a k + da k is given by 

c | A | 2 doc 1 da 2 ■ ■ ■ da„, (8.14) 

where A = ]^[ i<fc (8 ; — e k ) and c is the normalization factor, satisfying 

c J|A| 2 dot 1 • • • da„ = 1. (8.15) 
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We obtain 


1 

b l 

p n- 1 
e 2 


e"r 2 

er 2 


£ 1 1 

1 


p n 1 p n 2 ... p 1 

C„ O n 1 

^ 1 l)ai + («“2)a 2 + ■••+a„-i + Oa„] 


(8.16) 


where P permutes the a t • • • oc„. If we have an arbitrary sum of powers of the 
form 


•••«-) = (8.17) 

p 

where > h 2 > • • • > h n , then, it is easy to see that 


J Pht-hjfci ' ‘' “»)P«■•■«,(« 1 •••«») ^« 1 • •' 

_ fn!(2 tt)” \ih k = h k ., (8.18) 

(0 otherwise. 

Thus it follows that c = (n!) _ 1 (2?t) _ " in (8.15). According to AV, §10.5 for 
the characters A <v, (e, •••£„) we obtain the orthogonality relations: 


n ,j 2nyi J AX^AXM da,--- da k = <S V „. (8.19) 

Instead of using X (v> we shall use 

T <v) = AX (V> . (8.20) 

The T <v) (£ 1 •••£„) are therefore antisymmetric power sums in the s v with 
integer coefficients a hl ... hn . We obtain 

r <v) (£i •••£„)= £ a hl ... hn p hl ... hn (tx, ■ ■ ■ a„). (8.21) 


J|r (v) | 2 da, • • -rfa„ = n\(2n) n 

it follows that 

I K-d 2 = I- (8-22) 

hi -h n 

Since all the a hl ... hn are integers, only one of these a hi ... hn ^= 0 and is therefore 
equal to 1 or — 1, so that for a suitable series h,---h n \ 


r (v, ( ei •••£„)= ± Phr-hS* 1 • • • “»)• (8-23) 

The coefficient of the term e‘ Eli= ‘**® k in T (v) must be positive, because the term 
arises as the product of the term e ££ fc= 0 f ^ anc j a term 0 f th e power sum 
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X <v) (£, •••£„) having only positive coefficients. Thus we find that we must use 
the + sign in (8.23). Thus we obtain 

x (v) ( £l •••£„) = A-V*,...,„(«,•••«„). (8.24) 

Since the above results (from (8.10) on) hold for any irreducible representa¬ 
tion of U n (because in (8.10) we have not made any assumption about the 
irreducible representation space / <v) ) all irreducible characters have the 
form (8.24). 

Since, according to (8.17), (8.18) the A - l p hl ... hn are orthogonal for different 
h l > h 2 > ■ ■ ■ > h n , the A - l p h i ... hn comprise all possible irreducible charac¬ 
ters for the group U n . The A" l p hi ... hn must therefore form a complete system 
of functions with respect to the classes (that is, as functions of the oq • • • a„) 
(see AV, §10.5). This general group theoretical fact is another proof of the 
Fourier theorem: the set of all e~ with integer n, form a complete system. 

Have we obtained all possible irreducible representations of U n by the 
process of reducing the representation (for all /) ? To answer this question 
we must return to (8.10) and ask how the sequence h l > h 2 > ■ ■ ■ > h n 
depends upon the index (v), that is, upon the corresponding representation 
of the permutation group and therefore upon the tableau corresponding to 
this representation (see AV, §9). 

We shall now seek a basis vector u jt v) in // v) (/ (v) may, according to §7, 
correspond to an indempotent element of belonging to a tableau with 
row length n 1 ,n 2 ,. . . ,n k ) for which the exponents k v in 

M v) = ft# 

form a “largest” possible sequence k u k 2 , ..., h„ [A,, k 2 ,..., k n is said to be 
“larger” than k\, k ' 2 ,..., k’ n if k k > k\ or if k 2 > k' 2 for k t — k\, etc.]. Since 
the vectors in / <v) may be obtained by application of the idempotent element 
yJK to the vectors of ftp is spanned only by vectors of the form 

ylKcpft 1) • • • (p V f (f), 

where the groups of the same Vj do not have lengths longer than 
n u n 2 ,..., n k , otherwise the application of K would result in 0. Thus it 
follows that the “largest” possible sequence k u k 2 , ..., k„ is given by n u 
n 2 ,..., n k , that is, k t = n t . 

Since T (v) = AX <V) then from (8.16) and (8.24) it follows that the following 
relations hold for the hp. 

hi = n k + (n — 1), h 2 = n 2 + (n - 2), • • •, h n = n„, (8.25) 


where we set n v = 0 for v > k. 

According to §7, for sufficiently large n, we obtain subspaces ftp which are 
nonzero in i f for all tableaux and we therefore obtain all possible decom¬ 
positions f = n k + n 2 + ■ ■ ■ + n k where n v > n 2 > ■ ■ ■ > n k . The values 
h, = n t + (n — V) do not, however, run through all possible sequences 
hi > h 2 > ••• > h n . 
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If we replace the representation A -* Be U{ by A -> \A\ l B where \A \ is 
the determinant of A and / is an integer (/ = 0, ± 1, ± 2,...), then in the space 
/ <v) we obtain (for fixed /) a representation with the character | A |'2T <v, (e, • • • e„). 
If the highest powers of X (v) are given by the k u k 2 ■ . ., k„ then the | A |'X (v) 
may be obtained from the series fe, + /, k 2 + k n + l. By a suitable 
choice for / we can therefore always obtain positive values of the fe, and, for 
sufficiently large / we may set the k t = n ;. 

We therefore obtain all irreducible representations of U„, reducing 
U J „ (for all / > 0) and multiplying the irreducible parts of U{ by powers of 
the determinant. In particular, we find that the reduction of ( SU„Y contains 
all irreducible representations of SU n , because the determinants of the 
transformations from SU n are equal to 1. The characters for the irreducible 
representations of SU n are obtained from (8.24), where we note that ,£ v = 
e isv<* v = 1. According to (8.24) we find that two characters with index sets 
h u h 2 , ■ ■ ., h n and h \,..., h' n are equal if there exists a number / for which 
h[ = h t + l. For SU n we can, for example, set h„ = 0 and we need only use 
all index sets h t > h 2 > ■ ■ ■ > h n — 0 in order to obtain all irreducible 
characters. 

We will now compute the degree with which the representation of U n 
occurs in // v \ Since this degree is the same for S U n , all representations for 
which h'i = h t + l have the same degree as that with index set hi. The degree 
N (v) of the representation is 

jV ( v) = x M (l, 1,..., 1). 

From the formula (8.24) it follows that (in a formal sense) we would obtain 
0/0 for iV <v) . Therefore we shall now consider the special case ot v = (n — v)co. 
Here we obtain 




gihi(n- l)io gihi(n-2)to ... j 

g(/i 2 (n- 1)<» e ih 2 (n-2)a> ___ j 

gih„(n- l)ea giA„(n-2)ra _ j 

= n (e ih “° - eih n ~ n [(*( - h k )icol 

i <k i<k 


where the last expression gives the smallest power in to. In the same way we 
obtain 


A = fl (e i( "~ J)a - e i(n - k><0 ) ~ n ([(« - j) ~ (n- k)]ico). 

j<k j<k 

Thus in the limit co -* 0 we obtain: 


iV (v > = 


n ( h ‘ - 

i<k 


n ( k - o 


A (h u h 2 ,...,h„) 

A(n — 1, n — 2,..., 0) 
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with 


A(a 1; a 2 , ■ ■ ■, a n ) 


ar 1 a"r 2 
ar 1 ar 1 


a t 1 
a 2 1 

a„ 1 


Since the X (v) are known, from (8.9) we may compute the x M (P) — 
y M (y i, y 2 From (8.24) we obtain: 

Zrci >12■ ■ ■ nhiy 1 ’ 72, • • ^Phi’-hn ^^yi^yj * * • ’ (8.26) 

<») 

where, instead of (v) we use the row lengths n l n 2 ■ ■ ■ n k of the corresponding 
tableau. From (8.26) it follows that x„ 1 „ 2 ...„ fe (y 1 , y 2 , ...) is the coefficient of 
ei‘4 2 • • • 4" i n the expansion of A s yi s y2 • • • in powers of £, ■ • • e„. The degree g 
of the representation corresponding to the tableau • • • n k is obtained by 
substituting y 1 = y 2 = • • • = y f = 1 in x„ in2 ... nk (y u y 2 ■ ■ •), and for this case 
we find that s yi s y2 ■■■ = s{ =.(£"=, s v ) f . Therefore we find that g is the 
coefficient of e^ 2 • • • 4" i n the expansion of 


i svYzc- 

*= i / p 


-\) p Pe\-hr 2 


in powers of the e l ■ ■ ■ e n . (8.27) is, however, equal to 

/! 


Y(-i) p y 

p v^vJval-'-vJ 


el'ef • • • e^Ps", 


n- 1 


• £„• 


(8.27) 


If Pe" 1 • • • e° = ei‘4 2 • • • 4"> we therefore obtain 


flf=/lE(-l) p [(fci - fci)! (K - * 2 )! •••(*„- U!]“ 1 

p 





1 



1 



1 



IK - 

- (n - 

1)]! 

IK ~ 

- (n - 

2)]! 


M 




1 



1 



1 


= /! 

IK - 

- (« - 

1)]! 

IK - 

■ (n - 

2)]! 






1 



1 



1 



IK- 

- (n - 

1)]! 

[h„- 

■ (n - 

2)]! 


h„! 



f 








= A 

lh 2 \- 

•A-' 







D- 

••(hi 

— n 

+ 2) 

hi (hi 

- 1) • 

••(hi 

- n + 

3) 


1)' 

■(K 

— n 

+ 2) 

K(K 

- 1) • 

' ’ (h„ 

- n + 

3) 
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By selective subtraction of the columns, for the determinant we obtain 


hr 1 K ~ 2 ••• k i 

h n ~ 1 h n ~ 2 ■■■ h 1 

ri n fi n n n i 

Therefore we obtain: 

f'U(hi-h k ) 

i<k _ 

9 hy \ h 2 \ • ■ ■ h„\ • 


(8.28) 


For the group SU 2 which is the representation of the rotation group in spin 
space we may set h l > h 2 — 0 in p hlh2 (e u e 2 ), as shown above for the general 
case of SU„. If we introduce the quantity S by h t = 2S + 1 then S can take on 
the values 0, j, Thus, setting e, = x and s 2 = 1/x we obtain 

P 2 S+ i,o( e i> £ 2 ) = * 2S+1 ~ x~ (2S+1) 


Thus, from (8.24) it follows that (using S instead of v) 

2S+1 _ —(2S+1) 

X (S \s u e 2 ) = -“—r-= x 2s + X 2S “ 2 + • • • x- 2S . 

X — X 


If we use SU 2 as a representation of the rotation group, then we obtain 
x = e i,l>n where cp is the rotation angle and find that 

X (S \e u e 2 ) = e iS<p + e i(S ~ ^ + ■ ■ ■ + e~ iS<f . 


Thus we have again obtained the well-known character of the representa¬ 
tion 2) s of the rotation group. The degree of the representation is therefore 
given by 

N (S) = ( x< s % =0 = 25 + 1. 

For the corresponding character Xs(P) of the representation of S f in i s (t is 
the spin space), it follows from (8.9) that 

I Xs(P)(x 2S + x 2S ~ 2 + • • • + x- 2S ) = (x 511 + x~ n )(x n + x~ n ) ■■■. (8.29) 


Since in t f S < 2 f it follows that S = 2f — k where k is an integer > 0. By 
multiplication with x f {\ — x 2 ) it follows that for z — x 2 : 

I Xs(P)(z k -z /+1 - k ) = (l - 2)(1 + z«X 1 + z») 
s 

Since k < //2 it follows that Xs(P) i s the coefficient of z k on the right side of 
this equation. In particular, the degree g of the representation is the co¬ 
efficient of z k in (1 — z)( 1 + z) f , that is, 


9 = 



/K/-2/C + 1) 

k\(f~k + 1)! 


/A /-2/C + 1 
\kj f — k + 1 


(8.30) 
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The characters for the associated representations are given by x's = ( — 1) p jcs- 
The group SU 3 plays an important role in elementary particle physics; 
we shall therefore discuss the group SU 3 as an additional example. We will 
not, however, discuss its physical meaning for elementary particle physics. 

At this point we could evaluate the formula (8.24) subject to the additional 
condition h n = 0 for the case of SU 3 (see the above discussion for the general 
case of SU n ). Actually, it is preferable to make use of the so-called branching 
theorems, which we shall now state and prove. 

Branching Theorem for the Permutation Group 

The irreducible representation of S r corresponding to a tableau with row 
lengths n l5 n 2 ,.. ■, n k is decomposed into irreducible representations with 
respect to the subgroup Sy_ t corresponding to tables with row lengths 

«i - 1 n 2 ■■■ n k , 
n 1 n 2 - 1 • • • n k , 

tii n 2 n k — 1, 

where we omit those tables for which the line lengths are not decreasing. 
Each such irreducible representation of S f - t occurs exactly once. 

Proof. Let P be a permutation of Sy _ lt that is, a permutation of S / which leaves the 
last number / fixed. As a permutation of S , P contains a cycle of length 1 more than it 
does as an element of Sy_ From (8.26) we have concluded that in the expansion 

As r. s r2--- = ei 1 •••<£" 

the coefficient a hl ... hn is equal to x„, ... nit (y 1 , y 2 , • • •)• If we consider P to be an element 
ofS r _„ then with the cycles y t , y 2 ,... of P (as an element of S/_ j) we obtain: 

As r. s r 2 ''' = I Xny„ k (P)e1\ ■' • £"■ (8.31) 

Here x ni ...„ k (P) is the character of P in the irreducible representation of S f - s corres¬ 
ponding to «! • • • n k . 

On the other hand, if we consider P as an element of S f , since Sj = £" = i £ v (that 
is, s y for a cycle length y = 1), we obtain: 

A ( E • • • = - (8-32) 

Here y„ i ...„ fi (P) is the character of P in the irreducible representation of S f corres¬ 
ponding to n[ ■ ■ ■ n' k . 

If we multiply (8.31) by t e. v and then compare the result to (8.32) using (8.25) 
it follows that y„. ... nk (P) is the sum of the as given by the branching rule. 

Branching Theorem for U n 

If we restrict U n to a subgroup [/„_ , of a (n - l)-dimensional subspace of 
t we find that an irreducible representation of U„ corresponding to a tableau 
with row lengths n u n 2 ,...,n k (or to the values h 1; h 2 ,..., h k in (8.25)) 
is decomposed with respect to l/„_ x into irreducible representations 
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corresponding to tableaux with row lengths n\, n' 2 ,.. ■, n ' k ^, (or to the 
values h\,h' 2 ,, h' k ) where 

«i > n\ > n 2 > ri 2 > ■ ■ ■ > n ' k _ t > n k (8.33) 

or 

h l > h\ > h 2 > h 2 > h 3 > ■ ■ ■ > h ' n _ t > h n , (8.34) 

respectively. Each of the components corresponding to (8.33) (or to (8.34)) 
occurs exactly once. 

Proof. We obtain the mappings of t/„_ j if we map, for example, q>„ into <p„, that is, 
set c„ = 1. According to (8.24) and (8.17) we need to compute p hl .. . kn (a . u ..., 0) (that is, 
set a„ = 0): 


Pc, i,...,0) 




1 


£l‘ 

£ 2 ‘ • 

• • 1 

e ih 2 *i 

gi*2<*2 

1 

= 

E l 2 

«2 2 • 

• • 1 

e ihna 1 

gihri«2 

1 


P hn 

4" • 

• • 1 

0*1 _ 

£ 1 

e 1 ? £ 2 ‘ ~ 

£ 2 ‘ 


0 



0*2 

£ 1 — 

o*3 o*2 _ 

£ 1 fc 2 

£ 2 3 


0 




4" 



1 



s i' - 

- 4 2 (4 1 

— £ J 

2 


«5‘-l 

0*2 

b n- 

0*2 

£ 1 

- £*! 3 £ 2 2 

_ o*3 

£ 2 


0*2 

£ n — 1 

0*3 

fc n- 

o*n- 1 
£ 1 

— £;” 

1 _ 



❖- 1 1 

— £* n 
b n- 


and similarly 


A( El ,...,l): 


r.n—2 r.n-2 


er 1 


P n—2 
£ 2 


l l 


c n-2 c n -3 

£ 1 £ 1 

rJi—2 r.n-3 


c n-2 c n -3 
£ n-1 £ n-l 


«i i 
£ 2 1 

1 1 


(Si - l)(e 2 - 1)•••(£„-! - 1) 


= A(£„,..., «„-!)(£! - 1)(£ 2 - !)•••(£„-.! - 1) 

and we then may compute A - 1 p hv .. hn by using (8.24). 

Using the formula; 


£ — 1 


£ h 1 + ■ ■■ + £ h ' 
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from (8.24) we obtain 




phj 1 

£ 1 

+ 

• • + si 2 

e h 2 l 1 + • • 

• + £* 2 

ph 2 — 1 
S 1 

+ • 

■ ■ + e* 3 4 

el 2 ' 1 + • 

• • + e 2 3 

*hn - 1 — 1 
'1 

+ • 

■ ■ + e*r 

+ • 

• • + s 2 " 


The last determinant is, however, the sum of all determinants of the form 

ej 1 ' 

£ i' 2 e 2‘ 

S K,-, ... 

where h\, hi 2 ,... are all the values for which (8.34) is satisfied. From (8.25) we find 
that (8.34) is equivalent to (8.33). 


Branching Theorem for SU„ 

From the branching theorem for U n we immediately obtain a corresponding 
branching theorem for SU„, except that only those representations described 
by h\, h ' 2 ,... for which the h[ differ by a fixed integer are equivalent, as we 
have proven earlier. 

We will now apply this theorem to SU 3 ! An irreducible representation of 
SU 3 is characterized by two numbers l u l 2 , because we can set /t 3 = 0, as we 
have shown earlier. According to (8.25) 

hi = «! + 2, h 2 = n 2 + 1, h 3 = n 3 

here we can set h 3 = n 3 = 0. 

A representation corresponding to the numbers 


hi > h 2 > h 3 = 0 


is decomposed, as a representation of SU 2 into those representations h\, h' 2 
where 


h 1 > h\ >h 2 > h' 2 > 0. 

All representations with the same difference h\ — h' 2 = 2S + 1 are equivalent 
to the representation h\ = 2S + 1, h' 2 = 0, that is, to the representation of 
SU 2 with “spin quantum number” S. Thus it follows that the representation 
of SU 3 characterized by h,, h 2 , as a representation of SU 2 , decomposes into 
representations corresponding to spin quantum numbers S where 1 > 2S + 
1 < hj — 1, where the representation for the spin quantum number S has the 
following multiplicities: 

(1) 2S + 1 > h 2 and 2S + 1 > h i - h 2 : h, - (2 S + 1). 

(2) 2 S + 1 > h 2 and 2S T 1 ^ h j — h 2 : h 2 . 

(3) 2S + 1 < h 2 and 2S + 1 > hi — h 2 : (h t — h 2 ). 

(4) 2S + 1 < h 2 and 2S + 1 < h l - h 2 : (2S + 1). 
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Since n t = h 1 — 2, n 2 = h 2 — 1 and n 3 = 0 the irreducible representation 
of SU 3 in r f characterized by h l , h 2 occurs in r f with a multiplicity of 
/= hy +h 2 - 3. 

For the case in which h x = 3, h 2 = 1 we find that / = 1, that is, SU 3 is 
represented by itself in t. As a representation of the subgroup SU 2 it is 
decomposed in the representations for S = \ and S = 0 which occurs once. 

For the case in which hy — 3, h 2 = 2 we find that/ = 2. The representation 
occurs as an irreducible part in i x t. It decomposes as a representation of 
SU 2 in both representations for S = 0, j, exactly once each. This representa¬ 
tion is therefore 1+2 = 3 dimensional. 

For h x = 4, h 2 = 1 it follows that / = 2, and the decomposition is into 
S = 0, 1 (each simply). The representation is therefore 1 +2 + 3 = 6 

dimensional. 

For h y =4,h 2 = 2 it follows that/ = 3. It decomposes into S = 0 (simple), 
S = j (double) and S = 1 (simple). The order of the representation is therefore 
1 + 2 • 2 + 3 = 8. 

We will now briefly state a few additional cases. Here d will represent the 
order of the representation. 

hy = 4, h 2 = 3, / = 4; S = 0 (simple), S = \ (simple), S = 1 
(simple); d=l+2 + 3 = 6. 

fcj = 5, h 2 = 1, / = 3; S = 0, j, 1, | (all simple); d = l + 2 + 3 + 

4 = 10. 

h y = 5, h 2 = 2, / = 4; S = 0 (simple), S = \ (double), S — 1 

(double), S = | (simple); d=l + 2- 2 + 2- 3 + 4=15. 

hy — 5, h 2 — 3, / = 5; S = 0 (simple), S = j (double), S — 1 

(double), S = f (simple); d = 15. 

hi = 4, h 2 = 4, / = 6; S = 0, 4 1, § (all simple); d = 10. 


9 Perturbation Computations 

In order to make qualitative statements concerning the position of the parent 
terms (the multiplet energy levels) it is necessary, in a first approximation, to 
use perturbation theory to compute the energy changes for the general case 
in the same way as we have used in the case of the helium atom. We will 
assume that the Hamiltonian operator is given in the following form. 

H = £ H k + £ V ik , (9.1) 

k= 1 i<k 

where the H k act only on the coordinates of the fcth electron. Here V ik repre¬ 
sents the interaction between the ith and kth electron. V ik need not take into 
account the entire Coulomb interaction; a part of this interaction, the 
shielding of the Coulomb field of the nucleus may be taken into account by 


Putie. Pfcy 4iai 
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the H k . We note that / need not be the total electron number of the atom 
because in the vicinity of the nucleus the closed electron shells contribute 
primarily to the shielding. 

The initial configuration e = s 1 + e 2 + • • • + e„ may consist of in¬ 
equivalent nondegenerate terms e v . The eigenvectors of H k corresponding to 
the eigenvalue a k may be <p vv The eigenvalue problem in zeroth approxi¬ 
mation is expressed in terms of the space spanned by the P<p Vl ( 1) • • • <p v/ (/) 
as follows: 

<e<M i) • • • h z x{p)Pcp vv { i) • • • <p v/ c/)> 

r (9 2 ) 

= e<Qcp v ,( 1) • • • <p v/ (f), I x(P)Pcp v ( 1) • • • <p v ,(/)>, 

p 

where Q and P are permutations. Since H commutes with Q, from the fact that 
the <p Vi are mutually orthogonal and normalized, it follows that 

I x(T)<<M 1) • • • I v *Qr • • • <M/)> = ^(<2), (9.3) 

P i<k 

where ri — e — e° = e — ]T k e k . 

If we set 

a(P) = <<p Vl ( 1) • • • <p v/ (f ), Z V ik P- > V1 (1) • • • <P v/ (/)> (9.4) 

i<k 

then the equation can be written in the form 

xa = r\x (9.5) 

using the elements 

a = Z<*(P)P and x = Z x ( p ) p (9-6) 

p p 

from the ring of groups The solution of (9.5) can be greatly simplified 
by the reduction of the group ring. We need only solve (9.5) in one of the 
repective equivalent right ideals. If x t is a solution corresponding to the eigen¬ 
value r] 1 which corresponds to the idempotent element e (v> then the vector 

<P = *l<Pv,(l) ' •' <Pv f (f) 

is the corresponding solution of (9.2). Since x, = e (v) x u we obtain e <v) </> = <f>. 
Therefore $ is the desired solution for the subspace / <v) corresponding to 
e <v) in (7.2). We need only solve (9.5) for those idempotent elements e (v) 
(where e (v) x = x) for which the corresponding tableau has at most two 
columns, otherwise the Pauli exclusion principle cannot be satisfied using 
spin functions. 

In our case (e denotes the unit element, ( ik ) denotes the exchange of the ith 
and fcth electrons) we obtain 

a = I <P<MD •'' <M/)’ I ^v,(D ''' <Pv f (f)> p 

P i<k 

= e Z c * + Z (ikM*> ( 9 - 7 ) 

i<k i<k 
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where 


Cu = <<Pv,0>v k (fc), V ik (p Vi (i)(p Vk (k)), ^ ^ 

An = <9v k (i)(p vi (k), V ik (p Vi (i)cp Vk (k)}. 

We may formulate the problem (9.2) in such a way as to include the spin 
vectors and use the Slater determinant. Then the desired eigenvector of 
zeroth approximation will have the following form: 

X *(«!, •••,«/) E (“ l) Pp C < Pv,( 1 )''' ‘Pv/ZK.O)' • • «*//)] (9.9) 

ai •••a/ P 

or, what is the same, 

X (- l) p [P<p Vl (l) • • • q> Vf (f)-]PU, (9.10) 

P 

where 


u = x x(«! • • • o^x.a) • • • u Xf (f). 

Here U is a vector from the spin space of the/ electrons. Here it is clear that, 
on the basis of the Pauli exclusion principle, we may transfer the eigenvalue 
problem to the spin space. According to (9.2) the eigenvalue problem in the 
zeroth approximation is described by the equation 

K-i) p (e<Mi)---<x(/)> E v ik p (Pvi (i)-- ( p Vf (f))pu 

P i<k 

= *?£(-if^v/D• • • <M/)>’i)• • • <P Vf (f)>PU. (9.il) 

p 

We may set Q = e; the other permutations do not lead to any new problems. 
Thus, for 

a'(P) = (- l) p <<p Vl (l) • • • cp Vf (f), X V ik Pcp Vl ( 1) • • • <p V/ (/)> (9.12) 

i<k 

and for a! = YjP a'(P)P in spin space we need to solve the eigenvalue equation 

a'U = tj U. (9.13) 

The solution of the two equivalent problems (9.5) and (9.13) can be solved 
with the aid of the characters of the permutation group as follows: 

According to the general derivations in AV, §7 the trace of the right trans¬ 
formation (9.5) is equal to that of the left transformation ax. If we restrict our 
considerations in (9.5) to the subspace given by a single e (v \ then we obtain 
the following trace relationships for the sum of the eigenvalues ri i : 

X a(P)x (v) (P) = £ (9.14) 

P i= 1 

where r is the dimension of the (v) representation. If we then use the equation 
xa m = r] m x, then from 

a m = (X a(P)pJ = X a m (P)P (9.15) 
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it follows that 


Xa m (P) Z (v, (P)= X r,T. (9.16) 

p >= i 

From these equations it is, in principle, possible to determine the rj ;. 

The problem (9.13) can be treated in a similar manner. The spin space can 
be decomposed into products iV s x 'W Mv) (see (1.3c)) for the different values 
of S which belong to different representations A (v> of S f , the characters of 
which we shall denote by Xs( p ) (see §8). For a given value of S' from (9.13) it 
follows that: 


X a'(P)Xs(P) = X* (9.17) 

P i= 1 

and, more generally: 

Y J a m (P)xs(P)= i nf where a' m = X <( p ) p - (9-18) 

p i=i p 

Since the representations of S f in orbital space are associated with those in 
spin space, it is easy to see that both problems lead to the same result. 

The problem (9.13) can be treated in an elementary way with the aid of the 
spin invariants (7.16). The spin invariants characterize all vectors which 
transform in the same way under permutation, that is, an entire column of 
the matrix (1,3a). If the representation of S f corresponding to S has dimension 
g then there are g linearly independent spin invariants A 1 ,...,A g . We may 
then set U — X?=i x v A v ; we then obtain the problem 

^/(P)Pd v = ^U. (9.19) 

P,V V 

We may then compute, in an elementary way (see, for example, XV, §7) 

p A v = X 4,2V 

We then have the following problem to solve: 

X a'WvXv = 7 V 

p 

If, for a', we use the special form (obtained from (9.12) using the same pro¬ 
cedures used in the derivation of (9.7)) 

a' = e X C ik - X dk)A ik (9.20) 

i<k i<k 

then in spin space we may transform the problem (9.13) into the following 
remarkable form: 

Let S t and S k be the spin angular momentum operators for the ith and fcth 
electrons, then if we choose the basis such that (S ; + S k ) 2 and (S; + S k ) 3 are 
in diagonal form, then S t ■ S k must be in diagonal form. According to XII, §4 
there are two cases: The vector in spin space is antisymmetric in i and k and 





174 XIV Spectra of Many-Electron Systems 

(Si + S k ) 2 has the eigenvalue 0 or the vectors in^spinspace are symmetric 
and (Si + S k ) 2 has the eigenvalue^. From (S ; + S k ) 2 = Sf + S ; • S k + S k it 
follows that, in the first case, 2S ; • S k has the eigenvalue —§ and, in the second 
case, 2 — § = j. Thus \ + 2S t ■ S k has the possible eigenvalues — 1 and 1. Thus 
we have shown that 


(ik) = | + 2Si-S k . 


Thus, from (9.13) we obtain 

e Z (C ik - jA ik ) -2 ZA ik Si- sJt/ = rjU. (9.21) 

_ i<k i<k J 

The Coulomb interaction of the electrons is such that they behave as if their 
spin is elastically coupled with the potential energy —2 Y*i<kA ik Si-S k . 
If the “exchange integral” A ik is positive, as we would expect for the correla¬ 
tion energy (XII, §3), then the parallel orientation of the spin would lead to 
lower energy levels. Classically this fact has the following intuitive explana¬ 
tion : a change in the state from the parallel position requires work against 
the Coulomb(!) interaction, because, on the basis of the Pauli exclusion 
principle, a change of the spin is automatically affected by a change in the 
correlation in the position probability. If the spin is parallel, then both 
electrons “avoid each other’s company”; if antiparallel they “prefer each 
other’s company.” 

At the beginning we assumed that the initial configuration consists of 
nonequivalent nondegenerate energy levels. We shall now present an outline 
of how computations are to be made in the event that the above condition is 
not satisfied. 

Let the electron configuration be given by (n 1 l 1 )(n 2 l 2 ) •••(«///)• Suppose 
that the groups of equivalent electrons contain k u k 2 ,... electrons, so that 
k x + k 2 + •••=/. Let a denote the subgroups of S f which permutes only 
equivalent electrons. Two index sets m u m 2 , ..., m f and m\, m' 2 , ..., m' f are 
said to be congruent with respect to a if there exists a permutation in a which 
permutes one into the other. We can then classify index sets m 1 ,m 2 ,... ,m f 
into congruence classes, and choose a representative for each of these. For 
the perturbation computation we shall now consider the following eigen¬ 
functions : 

Z'Z x ( m i m 2-" w /> a i •••<*/) 

m a 

• z (-1 w/Kd) • • • m/)] 

p 

= Z' Z (- l) f fk lllBl (l) • • • cp nflfmf (f)-]PU mim2 ... mf , (9.22) 

m P 

where 


= Z x ( m i> m 2 ,...m f ,oc ! • • • <x f )u ai (l) ■ ■ ■ u Xf (f). 

a 
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Here means that the sum is to take place only over the representative 
index sets m x - ■ - m f . Congruent index sets will lead to linearly dependent 
functions (see §3). Let V = Y,t<k V ik ; the eigenvalue problem then can be 
written as follows: 

I' I(- VPcp nihmi ( 1) • • • 

m' P 

S zij ( 0 ■ ■ ' Vnflftnf{f\ P(Pnilimi(X) ' ' ' 

m' P 

(9.23) 

Since, for m x ■ ■ ■ m f and m\ ■ ■ ■ m’ f , only one representative from each con¬ 
gruence class is present, we find that 

<<2W imi 0) • • • <Pn f l f m f U), PfPnthm iO) ' ‘ ' (Pn f l f m' f (f)> = 1 (9-24) 

ifandonlyifm'i ■ ■ ■ m' f = m l ■ ■ ■ m f and P is a permutation from a which leaves 
the index set m l - ■ -m f invariant. We shall denote such permutations by P m . 
Thus the right side of (9.23) is equal to 

n I (-1 Y-P m U mi ... mf = (9.25) 

Pn, 

We shall denote the subgroup formed by the elements P m by a m . 

Since 

PQPm'(Pnilim\(^') * * ’ 

= <<Pn 1 i 1 m 1 (l) • • • <P« f l f m,(S), VQ<p„ lhmi (l) ' ' ‘ <Pn,l f m' f (f )> (9.26) 
we can write the left side of (9.23) as follows: 

rir S(—i) s (- i ) Pm ' 

m Q P m ' 

* ' * * Priflfmf(f\ ^Q^Pnihm'iiX) * * ' 

QPm’U mi - m> 

= I'm x ■ • • m f , m\ ■ • • rn' f )QW m ,... m . f , (9.27) 

m' Q 

where the sum ) takes place only over the permutations Q of each left 
coset of <x m . and where 

a(Q ; m 1 - ■ ■ m f , m\ ■ ■ ■ m ' f ) 

= (- l) G <<P Ml l,m,(l) • • • (Pn f l f m f (f), • • • <jO B/I/m> (/)>. 

Using the following notation 

= Z <m V2; m l ' ' ' m /> W 'l ' ‘ ' ™' f )Q 

Q 

we may rewrite the eigenvalue equation (9.23) as follows: 

^mi ••-m/, mi •••»!/^m'i ••-m/ 
m' 


(9.28) 

(9.29) 

(9.30) 
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If we obtain a rj and W m - r .. m ' f from the above equation, then from (9.22) we 
obtain the entire eigenfunction in zeroth approximation as follows: 

r z (- i) • • • 

m P 

= Z' I <m) (- 

m Q 

We may reduce equation (9.30) by considering its application to a subspace 
of spin space corresponding to the spin quantum number S. We may make 
the transition from the m 1 ■ ■ ■ m f to the different quantum numbers L, m L 
of the total orbital angular momentum, where the latter are obtained from 
the m 1 ■ ■ ■ m f in the manner described in §3 used to obtain the various energy 
levels. For the chosen value of S we may, on the basis of the discussions in 
§§3 and 7, obtain energy levels with L values L u L 2 , ■ ■ ■ where the terms L X S 
and L 2 S ■■■ may occur with multiplicities and A 2 ..., respectively. Then, 
from (9.30) it follows that, with the eigenvalues rfjfi (v = 1,2,..., A ; ), by taking 
the trace, we obtain 

Z' T! m) a(Q;m 1 ---m f ,m 1 ---m f )x s (Q ) 

m Q 

/with j £ wt.l < Afj 


= I (2L £ + l)f|£? + (2M + 1) X (9.31) 


L„ v 

(with L, ^ M) 


L„ v 

(with L, > M ) 


If the Aj = 1, then we have already obtained the eigenvalue tj Li ; otherwise we 
only obtain the sum £v=i rj^]. Then we may use the iterated equation of 
(9.30) 


Y' a (k) , , W / , = j/IT 

Zj ■••!«/,mi •••m/ rr mi •••m/ 'I 


(9.32) 


where 


7 (k) 

• • • mf , mi • • - my 


, ,=V'a „ , , 

i•••my Zj w mi •••m/,mi ••-m/ M mi • ■-m/, mi • •-m/ 


from which we can obtain relationships similar to (9.31) for the /cth power of 
the eigenvalues 


Tita^fccjwi iiteaZ Pfc ylt e J. 



CHAPTER XV 


Molecular Spectra and 
the Chemical Bond 


1 The Hamiltonian Operator for a Molecule 

Let r k and p k (using Latin indices) denote the positions and momenta of the 
nuclei and let r K and p K (using Greek indices) denote the positions and 
momenta of the electrons, respectively. The Hamiltonian operator for the 
nuclei and electrons is then given by 

H ^ Z — pl + Z pl + U(r k , r K ) + H s , (1.1) 

z k=l m k M k=1 


where m k are the masses of the nuclei, m is the mass of an electron and H s are 
the terms involving the spin of the electrons. U is the potential energy 


U(r k , r K ) = x Z 

Z kl 


1 „ Z k Z t e 2 , 1 


e 


k*l 


+ T Z 

^ K x Y kX 


z—> 

kX r kX 


where Z k e is the charge of the kth nucleus. Let p k = (l/Ograd* and p K = 
(l/i)grad K . Then we may rewrite (1.1) as follows: 


H = 


1 V 1 A 

- ) — A t 

2 k"i k 


2m 


Z A k + U + H s . 


( 1 . 2 ) 

177 
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In order to separate the center of mass motion we shall now introduce new 
coordinates (see also VIII, §§1 and 2) as follows: 


h f h 

MR = £ m k r h + m £ r K ; M = £ m k + fm; 


k= 1 

K = r k - r 1? 


K= 1 


(1.3) 


i- = r„ 


R, 


where k = 2, 3,..., h and r } is the position of the first nucleus. 
We therefore find that (1.2) is transformed into 

1 

" A_ 2' 2M 


H = 

2 M 


1 grad^ 


K= 1 




(1.4) 


where A acts upon the coordinates of R, grad; and A' k upon r' k , and A;, grad; 
upon respectively. Since U and H s do not depend upon the absolute 
position of the particles, that is, they are invariant under translation^ of all 
particles, in the new coordinates (1.3) they must be independent of R. The 
only term in (1.1) which depends upon R represents the kinetic energy of 
the center of mass. We may neglect this term since we are only interested in 
the internal energy of the molecule (because we are interested in the spectra 
and the binding energy). Therefore we shall set 

»=- sradT 


k= 2 2m k 


2m x 


k= 2 




(1.5) 


The first two terms represent the kinetic energy of the nuclei; the second term 
is the kinetic energy of the first atomic nucleus, the momentum of which (in 
center of mass coordinates) is equal to the negative sum of the momenta of 
the other nuclei. The following term in (1.4) 



which represents the effect of the electron motion upon the nucleus should 
be taken into account. However, since the mass of the electron is small 
compared to that of a nucleus, we find that this term is small compared to 
the third term in (1.5), we shall neglect this term as in the case of atoms. 


2 The Form of the Eigenfunctions 

Here, as in the case of the atomic spectra, we shall for the present neglect the 
term H s in (1.5). We shall investigate the effect of this term later in §9. The 
first two terms in (1.5) contain the masses of the nuclei in the denominator. 
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Since the latter are more than a thousand times the mass of the electron, the 
corresponding kinetic energy of the nuclei will be small compared to that of 
the electrons (see §8). This fact follows directly from the Heisenberg un¬ 
certainty relations The localization of electrons and nuclei in the molecule 
to a volume of radius d will result in momenta of the order of p ~ 1/d. For 
the nuclei we will obtain a smaller kinetic energy than we will obtain for the 
electrons. If we neglect these two terms, then we obtain the following 
Hamiltonian operator: 

H e = - ~ I K + U. (2.1) 

2 

Here H e is to be applied to functions of r k and r' K . The eigenfunctions may, 
however, be written in the form 5(r k — r k )q>(r' k , r' K ) where 

HMh, K) = E(? k )<p(? k , ?') (2.2) 

which, intuitively, means that the nuclei are fixed at locations r k and are 
therefore fixed centers of force for the electrons. E(r k ) is the energy of the 
electrons and is dependent on the positions of the nuclei (including the mutual 
potential energy of the nuclei). We may suppose that we would obtain a very 
good approximation for the energy eigenvalues for (1.5), that is, for 

" - - i ■ 41 - i it + "■ - H ‘ +H - <Z3) 

if we substitute E(r k ) (obtained from (2.2)) (here, for simplicity, we replace 
r k by r k ) and use the eigenvalues of the equation 

{Hk + E(r k )}iKr k ) = ap<?& (2.4) 

Thus, we assume that, because of the large masses of the nuclei, the electrons 
move as if the nuclei were at rest, and that the nuclei move under the influence 
of the position-dependent energy E(r k ). Mathematically this means we 
approximate the eigenfunctions of (2.3) by the form: 

(p = Mridviri r K ), (2.5) 

where (p satisfies (2.2), and assume that the dependence of (p upon r k is much 
weaker than that of ip, so that, in the differentiation of the function cp with 
respect to r k we may assume, in first approximation, that (p is constant. We 
will later justify this assertion. 

The approach (2.5) is reasonable only if the eigenvalue E(r k ) is non¬ 
degenerate [as is in general the case of molecules having more than two atoms 
because degeneracy shall occur only as a result of spatial symmetry of HJ. For 
the case of two nuclei there is a rotational symmetry. Here we find it necessary 
to replace (2.5) by linear combinations of the form (2.5) with different cp 
corresponding to the same value of E(r k ) —see §8. 

For more than two atoms we obtain a better approximation for e then (2.4) 
if we use the form (2.5) where we obtain <p from (2.2), and we determine the 
function ip by means of the Ritz variational principle (VIII, §3). 
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Then, instead of (2.4) we obtain 

{H k + E(r K )}ip = eij,, (2.6) 

where 


E(r ' k ) = E{r k ) + J (p(r’ k , r' K )H K <p(r k , r' K ){dr^) f , (2.7) 

where ( dr' K ) f is integrated over all of the electron coordinates. Note that (2.7) 
is a better approximation than (2.4) with respect to the dependence of (p upon 
the coordinates of the nuclei. 

Equation (2.6) has, in addition to a continuous spectrum, discrete eigen¬ 
values for the lowest energy eigenvalues only if E(r' k )~ or, to a good approxi¬ 
mation £(r[)--has an absolute minimum; only then will we have bound states. 
If the lowest energy eigenvalue of (2.2) has this property, then the ground 
state is bound, that is, we have a chemically bound molecule. In the following 
section we shall consider the eigenvalues of (2.2) and their dependence upon 
the positions of the nuclei. The minimum for the ground state of (2.2) yields 
the equilibrium position of the nuclei, that is, the chemical structure of the 
molecule. 


3 The Ionized Hydrogen Molecule 

The simplest example is the ionized hydrogen molecule in which an electron 
moves in the field of two nuclei. Equation (2.2) takes on the following form 

A-+ - W) = Ecp{7), (3.1) 

\ AYYl V j Y 2 / 

where r is the position vector of the electron, r x and r 2 are the distances from 
the electron to the nuclei 1 and 2, respectively, and a is the separation between 
the two nuclei. Note that cp and E depend upon a. E = E — e 2 ja is the energy 
of the electron. 

Equation (3.1) can be separated and solved by introducing elliptical 
coordinates (see [14]). We will now seek to obtain, using symmetry con¬ 
siderations, a qualitative description of the dependence of the energy E’ 
upon the distance a. All problems with two nuclei are invariant under 
rotations about the line joining the nuclei and under reflections in planes 
containing the above line. These symmetries form a group which we shall 
denote by d. If we choose the line joining the nuclei as the 3-axis we can 
describe a special reflection as follows: 

A = 

s 2 : A = -* 2 , (3-2) 

x ' 3 = x 3 . 
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If D a is a rotation about the 3-axis by the angle a then it easily follows that 
(e = unit element of the group): 

s 2 D« = £>-*s 2 , (s 2 ) 2 = e. (3.3) 

Each element of d can be described either in the form D x or s 2 D a . 

The representations of d are easy to find. The rotations D a form an abelian 
subgroup of d, the irreducible representations of which are one-dimensional 
and are already familiar to us as representations of a subgroup of the entire 
spatial rotation group (VII, §3): 

D x u m = e~ inux u m (m = 0, ±i ± 1, ±f,...). (3.4) 


According to (VII, §3), in the space :d h we need only consider the values 

m — 0 , 1 , 2 ,.... 

Therefore we may choose basis vectors of the form (3.4) for any (not only 
for the irreducible case) representation space 01 of d. Since, according to 
(3.3) we obtain 

D x (s 2 u m ) = s 2 D_ a w m = e ima (s 2 u m ) 


the two vectors u m (with m > 0) and s 2 u m span an invariant two-dimensional 
subspace (with respect to d) of 3k. This subspace is, by construction, clearly 
irreducible. The set of all vectors v which are invariant under rotations D a 
(that is, D x v = v) forms an invariant subspace i 0 with respect to d. The basis 
vectors in can be chosen such that they are eigenvectors of s 2 . 

According to the second relation in (3.3) we therefore obtain either 
s 2 v = v or s 2 v = —v. In this manner we obtain all irreducible representa¬ 
tions of d. They can be characterized by a scalar A > 0. For A > 0 they are 
two-dimensional: 


D*v A = e iA X, 
D a v- A = e iAx V- A , 


s 2 Va = V- A 


(3.5) 


and, for A = 0 they are one-dimensional. Here it is necessary to distinguish 
the two cases 0 + : 


D x v 0 = v 0 , s 2 v 0 = v 0 

(3.6a) 

D a v 0 — V 0’ S 2 V 0 — ~ v 0 4 

(3.6b) 


We shall denote these representations by d 0 ■■■, d 0 . 

Since, in the case of the ionized hydrogen molecule (H 2 ), both nuclei have 
the same charge, there is an additional symmetry—that of reflections about 
the midpoint of the line joining the two nuclei. Let us choose this point as the 
origin of coordinates. We then obtain: 

s: x j = -x u x' 2 = — x 2 ,x' 3 = —x 3 . (3.7) 

Since s commutes with d, it is possible to choose the irreducible representa¬ 
tion of the group generated by s and d such that, for all vectors either 
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sv = v or sv = —v, that is, in (3.5) we have two cases; the first, we denote by 
A e satisfies 


SV A = V A , SV- A = V- A , 

the other, which we denote by A 0 satisfies 

sv A =-v A , SV- A = -v- A . 

Similar expressions can be found corresponding to (3.6a), (3.6b) for the cases 

o;, o:, 0~, 0~. 

For the case of Hj, <p depends only upon the coordinates of a single 
electron; thus for A = 0, (p does not depend upon the polar angle about the 
3-axis and is therefore only a function of x 3 and x /x\ + x \. From this it 
follows that s 2 (p = (p, and we may therefore rule out the case O ~. 

In order to obtain information about the qualitative behavior of E'(a) we 
shall consider the two limiting cases a ~ 0 and a~oo.Fora = 0we obtain 
the same energy levels for £'(0) we found in the case of He + and we therefore 
obtain the same structure as we found in the case of the hydrogen atom 
(Figure 21). £'(0) is therefore characterized by a principal quantum number n. 
For each n the orbital angular momentum can take on values / = 0,1,2,..., 
n — 1. To each l there are 21 + 1 linearly independent eigenvectors. If we 



Figure 21 Energy of the electron for the H 2 molecule. 
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permit a to increase, then the Hamiltonian operator is no longer invariant 
with respect to the entire rotation group, but only with respect to s and d. 
The energy levels will therefore be split. From XIII, §3 we know that under 
reflection s the eigenfunctions of orbital angular momentum / are multiplied 
by (— 1)'; they are even (e) or odd (o) depending on whether l is even or odd. 
For continuous changes in a this property cannot change. For a single 
electron we shall write X instead of A. 

The 21+1 eigenfunctions for the quantum number l fall with respect to 
s and d in the irreducible representation X = l, l — 1 0 + with (e) or (o) 
depending on whether / is even or odd. Instead of X - 0, 1, 2,... we often 
write a, n, S,. ... 

The ground state n = 1 is nondegenerate and therefore cannot be split. 
It corresponds to the energy level <x e + (Figure 21). The energy level n = 2 is 
split, since l = 0 or 1, into the energy levels rr e + , al and n 0 . Similarly, the 
energy level n = 3 is split, since / = 0, 1, 2, into energy levels a ', <x+, n a ; <r e + , 
7t e , <5 e . The quantitative behavior of the splitting for increasing a can be 
obtained by a perturbation calculation. The latter, however, will not provide 
a measure of the positions of the energy levels for intermediate distances. 

For this purpose it is necessary to consider the opposite case—a ~ oo. 
Here we find that the electron is in the vicinity of one of the nuclei, and has 
the energy levels associated with the hydrogen atom, as characterized by the 
principle quantum number n. To each n there correspond the l values 
l = 0, 1,..., n — 1. Here we note that to each value of n and / there exist 
2(2/ + 1) linearly independent eigenfunctions instead of (2/ + 1) because 
the electron can be found either around the first nucleus <p' 1)m or the second 
nucleus (p \ 2)m . Since under reflection about the nucleus, <p‘ m becomes multi¬ 
plied by (— l) 1 , it is easy to show that for reflections about the midpoint 
between the nuclei we obtain 

M(1)m = (— l)V(2)m, M(2)m = (~ l)V(l)m- 

We therefore obtain the functions 

(Pin, = (P(l)m + (-l)V(2)m, 

(j.o) 

<Pom = <P\l}m - (-l)V(2)m. 

which are either even (e) or odd (o) with respect to s. Since the <p l eA and (p[~ A 
form a representation of d, it directly follows that the eigenfunctions corres¬ 
ponding to an energy level n, / are associated with the irreducible representa¬ 
tions A e , A 0 where A = /, / — 1,..., 0; for the case A = 0 the only possible 
cases are 01 and O^. As the nuclei approach each other the corresponding 
energy levels are split into irreducible representations, as is shown in the 
right side of Figure 21. Here we may obtain a numerical estimate of the 
initial splitting by means of perturbation theory. 

How do the energy levels for a ~ 0 transform into those for a ~ oo as the 
distance a changes continuously? It is clear that the representation properties 
of the eigenfunctions cannot be changed in a continuous change in a. Thus, 
only energy levels having the same representation class (pedigree) can be 
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transformed into each other. We shall show below that, in general, energy 
levels of the same pedigree cannot overlap. 

On the basis of the above two rules it is possible to uniquely determine 
which energy levels are transformed into each other as follows: For each 
pedigree we need only begin with the lowest energy level and sequentially 
identify corresponding pairs of energy levels. In Figure 21 the behavior of 
E'(a) is illustrated using precise calculation. The most striking case is that of 
the strong decrease in the lowest a* energy level with decreasing a. If we 
consider E = E' + e 2 /a (Figure 22) we will find that there is a strong binding 
of the two nuclei because there is a pronounced minimum of E(a). 

We may now treat the many electron molecule in the same way as we have 
in the case of many electron atoms in terms of one electron moving in the 
average field generated by the other electrons (Hartree method, XII, §3). 
In this way we obtain a one-electron problem with a non-Coulomb central 
force which is analogous to the case of the alkali metals described earlier. 
Here we find that for different values of / the energy levels no longer coincide 
(See Figure 23, left- and right-hand sides). The nature of the splitting and the 
meaning of the connection lines in Figure 23 is clear from the above explana¬ 
tion for the case of two identical force centers. If the two force centers are not 



Figure 22 Total energy of the H J molecule. 
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Figure 23 Energy levels of the electron tor the case of two identical force centers. 


identical, then we obtain a different situation as illustrated in Figure 24. Here 
the reflection operator s is not a symmetry operator, and the designations 
odd and even are no longer meaningful. For the case a ~ oo we have twice 
as many energy levels as we did in the case of identical centers because the 
values E for the cases that the electrons are found near center (1) or (2) are 
different. Thus Figure 24 is understandable without additional explanation. 

In closing we will now verify the result used often above, that two energy 
levels having the same pedigree do not overlap. Let (p x (a), <p 2 , ■ ■ ■, (p„ and 
(p n+ 1 , (p n+ 2 , ■ ■■, <p m are the eigenvectors of H(a) corresponding to the eigen¬ 
values s x (a) and e 2 (a) which may intersect at a — a 0 (that is, e 1 (a 0 ) = s 2 (a 0 )). 
Here we assume that q> x , <p 2 ,..., cp„ and <p n+ u cp n+2 ,..., (p m (where m = 2n) 
are two equivalent irreducible representations of the symmetry group of 
H(a). If we change H(a ) by adding a small term kV(a) where V(a) belongs to 
the same symmetry group as does H{a), then, from XI, §§6 and 7 we obtain 
approximate values for the new eigenvalues E for small A from the secular 
equation: 


(«i(«) - E)5 ik + W ik (a) A V u (a) 

W rk (a) (e 2 (a) - E)8 rl + kVJa) 


(3.9) 


where i, k = 1, 2,..., n; r, l = n + 1,..., 2« and F si = <( p s , F<p,>. Since 
V(a) admits the same symmetry operation, the “correct linear combinations” 
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Figure 24 Energy levels of the electron for the case of two nonidentical force centers. 


can, according to XI, §7, be completely specified as follows: xcp 1 + ycp n+1 , 
x(p 2 + y(p„ +2 ,.. • > where xcp x + y(p„ +1 and x(p 2 + y(p„ +2 correspond to the 
same eigenvalue, etc. Thus we find that (3.9) can be replaced by an equivalent 
set of two-dimensional secular equations: 

£i(a) - E + XV u !(a) AV hn+ t (a) 

Wn+i.i(a) s 2 (a) - £ + AF„ +lj „ +1 (a) 

The difference in the roots E u E 2 of (3.10) are given by 


= 0. (3.10) 


Ei — E 2 — s/(si — e 2 + — AF„ +li „ +1 ) 2 + 4A 2 | V 1<n+1 \ 2 . 


Thus we find that E x — £ 2 ^ 0 whenever V 1>n+ ; / 0. The terms E l and £ 2 
cannot, therefore, coincide, as we have assumed (see Figure 25a). If the 
(p u ...,(p n and <p„ +x ,... ,cp m belong to different irreducible representations, 
then all nondiagonal elements satisfy V ik = 0, then £, — £ 2 = — e 2 + 

AFi.i — XV n + 1>n+ Both terms will therefore be somewhat displaced (by 
hVi'i or AF„ +1>n+1 ) and may intersect (see Figure 25b). Since a small per¬ 
turbation which does not affect the symmetry of the system gives rise to a 
separation of two terms of the same pedigree but keeps two terms of different 
pedigrees intersecting, the intersection of terms of the same pedigree in 
complicated systems is highly unlikely. 
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a, 


a 


a b 

Figure 25 The behavior of two terms (a) of the same pedigree and (b) of different 
pedigree. 




4 Structure Principles for Molecular Energy Levels 

As in the case of atoms, we shall now seek to obtain an overview of the 
electron energy levels for molecules by considering the one-electron eigen¬ 
functions and “turning on” the mutual interaction of the electrons. For two 
atom molecules the invariance groups are the permutation (symmetric) 
group S f for the / electrons and the rotation and reflection groups d. If 
both nuclei have the same charge, then we need also include the reflection 
operator s given in (3.7). The only difference between this case and the case of 
the atom described in XIV, §3 is that the rotation group @ 9 is replaced by 
d (or by d and s). The following results obtained earlier remain applicable: 
Full shells do not affect the term manifold; n equivalent electrons are 
described by the same manifold as is described by a full shell missing n 
electrons; equivalent electrons can be separately treated and then joined 
together without taking the Pauli exclusion principle into account. 

For the atomic case we found it necessary to reduce the representation 
3) i x 3 y . For two atom molecules it will be necessary to do the same for 
sd x x . If / > 0 and A' > 0 then sd x x sd x , will be generated by four 
vectors (using (3.5)) v x u r , v x u_ x ., v_ x u r , v_ x v_ x .. v x u x . and v_ x u_ x , span a 
subspace of the representation sd x+r and v x u_ x ., v~ x u x . for the repre¬ 
sentation sd\ x ^ x \ (providing A d A'). If A = A' then v x U- x + v- x u x > belong to 
the representation sd 0 + and v x u_ x - v_ x u x belong to the representation 
sd 0 -. Thus, for A > 0, A' > 0 we obtain 


sd x X Sd x , = + ^-*'1 

\sd 2A+ sd o+ + sd0- 

for A # A', 
for A = A'. 

(4.1) 

In the same way, for A > 0, it is easy to show that 



sd x x sd 0 ± = sd x 


(4.2) 

and, trivially, 



,sd 0 + x sd Q + = sd 0 - x sd 0 - = sd 0 +; sd 0 

+ x sd 0 - = sd 0 -. 

(4.3) 


The formulas (4.1)—(4.3) can be used directly only in cases in which several 
inequivalent groups are joined together, each of which consists of equivalent 
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electrons. The case of a set of equivalent electrons must be treated separately. 

We may now develop a rule for eigenfunctions which is similar to that 
developed in XIV, §3, where we list the values + A, ± \ for each electron. For 
example, for two equivalent n electrons, we obtain the following table: 

A S 


( 

1, 

*) 

( 

i, 

i) 

2 

0 


( 

1, 

i) 

(- 

i, 

i) 

0 

1 

-1 

( 

1, 

i) 

(- 

i. 

-i) 

0 

0 


( 

1, • 

-i) 

(- 

i. 

i) 

0 

0 


( 

1, - 

-i) 

(- 

i, ■ 

-1) 

0 

-1 

-1 

(- 

-1, 

1) 

(- 

i, ■ 

-i) 

-2 

0 



In the last column there is, at two places, a (— 1). Here we have written (+1) 
or (—1) depending on whether the vector on the left is multiplied by the 
factor +1 or — 1 under the reflection s 2 . 

In cases in which the vector is not reproduced up to a factor upon multi¬ 
plication by s 2 , the last column was left blank. The factors +1 and — 1 are 
obtained, according to the above definition, from ( — l) p+v (from the Pauli 
exclusion principle—Slater Determinant) where v is the number of times 
0~ occurs and p = 0 or 1 depending on whether an odd or even permutation 
is required in order to return the symbol sequence (+A, ±f) altered by s 2 to 
its original order. On the basis of the important trace relations (see AV, §§8 
and 10.5) for this example it is easy to see that there will be three energy levels 
A = 2, S = 0;A = 0, S=1 and A = 0, S = 1. Are these 0 + or 0~ energy 
levels? In the space for the equivalent electrons under consideration we find 
that the trace of s 2 is, on the basis of the last column, equal to —2. The 
character of s 2 in sJ x (A ^ 0) is equal to 0, in s/ 0 - it is equal to 1, and in 
j/ 0 - is equal to - 1. Therefore, if x, y denote the characters of s 2 for the energy 
level A = 0, S = 1 and A = 0, S = 0, respectively, then we find that the 
following equation holds: 3x + y = —2. 

Since x = +1, y = ± 1, the only solution is x = — 1, y = 1, from which 
we find that the terms are 0~, S = 1 and 0 + , S = 0. For A = 0,1, 2,... we 
shall write X, H, A,...,; thus, in conventional notation we have obtained the 
following result: Two equivalent n electrons (written n 2 ) correspond to the 
energy levels 3 X~, 1 Z + , 1 A. In §6 we shall investigate the energy levels for the 
case of the hydrogen molecule. 

In order to determine whether a state results in the binding of the two atoms, 
that is, whether E(a) has an (absolute) minimum, it is necessary to examine 
the behavior of E(a ) as a function of a. 

5 Formation of a Molecule from Two Atoms 

While the structure principle described above together with the Hartree 
method leads to excellent quantitative results for intermediate distances a, 
for large values of a it is preferable to use perturbation theory beginning with 
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the “zeroth” approximation of separated atoms in order to determine the 
behavior of the energy levels. Using group theory we may precisely determine 
the possible terms for the molecule which arise from the energy levels of the 
two separated atoms by studying their behavior as the quantity a varies 
continuously from oo. Of course, we will obtain the same energy manifold as 
we have obtained from §4; here, however, we will determine which molecular 
energy levels continuously transform into which atomic energy levels of 
the separated atoms. 

If we begin with a pair of separated atoms, then we would be concerned 
with combining the electrons of one atom with those of another in order to 
obtain a new system. Since for a ~ oo we are concerned with two inequivalent 
electron groups, we may, according to XIV, §3 obtain the corresponding 
energy levels by adding the spins corresponding to the representations of the 
permutation group; here we do not have to take the Pauli exclusion principle 
into account. For example, if we have two atoms with quantum numbers 
L u Si and L 2 , S 2 , respectively (where L,, L 2 are the orbital angular 
momentum and S u S 2 are the spin angular momentum quantum numbers 
corresponding to the representations of the permutation group—note that 
we do not consider the effect of spin on the energy), then we obtain energy 
levels with S = S 2 + S 2 , S x + S 2 — 1,..., |Sx — S 2 |. To each of these S 
values we can associate each possible value of A. 

Under restriction to the subgroup d the representation may be de¬ 
composed as follows: 

x + • • • + (5-1) 

We may, therefore, obtain all possible A values providing we reduce 
.s/ Al x ^ a 2 by using (4.1)—(4.3) where A = L u L 2 — 1,..., 0; A 2 = L 2 , 
L 2 — 1,..., 0. For Ax = 0 and A 2 = 0 do we have the representations 
sd 0 + or s >£ 0 -? To answer this question we make use of the reflection operator 
s 2 . Let s be the reflection about the origin of the coordinate system. Then we 
obtain 

s D 2 ( n ) = D 2 ^s = s 2 , (5.2) 

where D 2(n) is the rotation operator about the 2-axis by the angle n. 

From (VII, §3) we find that the vector corresponding to M = 0 is multiplied 
by (— 1) L when it is transformed by D 2(n) . Therefore we obtain or sd 0 - 

depending on whether (— l) L w is equal to +1 or — 1 where w depends on 
whether the atom energy level L is even or odd (that is, w = — 1 for energy 
levels of the form 2S+ o = odd, see XIV, §5). 

If both nuclei are identical, then we need to introduce the character of the 
eigenfunction under reflection s about the midpoint as part of the charac¬ 
terization of the energy level as an index e and o. Here it is evident that for two 
different energy levels L u S x ; L 2 , S 2 of the separated atoms the resulting 
energy levels are doubled, once as even (e) and once as odd (o), since we only 
need to construct linear combinations as we have in (3.8): 

(1 + s)(p (l) (p (2) ; (1 - s)(p (1) (p (2) , (5.3) 
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where <p (1) is an eigenfunction for atom (1) for the energy level L u S { and <p i2) 
is such for atom (2). For the case of identical atomic energy levels 

{L\ = L 2 , S 1 = S 2 ) 

each term occurs only once because s(p w (p {2) is not linearly independent with 
respect to <p (1) <p (2) and the eigenfunctions obtained from the latter by per¬ 
mutation of the electrons. If we reduce ,s/ Al x .a/ A2 as above for the case 
Ai # A 2 we would find that the above procedure (even in the case of identical 
atomic energy levels) is equivalent to the energy level manifold for inequivalent 
electrons; thus, in making linear combinations (5.3) we would find that each 
energy level contains both even and odd terms. However, for A) = A 2 and 
A' 2 = A, we find that no new terms are introduced by x ■*Ai(0 
because they are already included in the linear combinations (5.3). For 
Aj # A 2 we therefore obtain all terms, both the even and odd terms, but, 
independent of the order of the A x , A 2 , each only once. 

For the case in which A t = A 2 the corresponding molecular energy levels 
A = 2Aj, 0 + or 0~ have only the following reflection characters e which we 
shall now derive: For each L, S term of an atom the only position dependent 
eigenfunctions (according to XIV (1.2a); here we shall use the symbol cp 
instead of t// <p^ v correspond to a definite spin value; here, for fixed M, the 
index v corresponds to representations A of S f . Thus, for the energy levels 
A = 2Aj the eigenfunctions are given by 

P(p\ W 1 . 2 > • • • . />a,v'(2)(/ +!,•••, 2/); 

the same holds also if we replace A x by — A v Here (1) refers to the eigen¬ 
function for the first nucleus (a notation which we have often used); 1,2,...,/, 
/ + 1,..., 2/ refer to the coordinates of the 2 f electrons. Here P refers to the 
permutation operators for the electrons. Here we find that the reflection s is 
identical to reflecting the eigenfunctions <p^, v about the nuclei and exchanging 
the electrons of the first atom with those of the second. Since we are con¬ 
sidering the same atomic energy levels, for the product <p (1) <p (2) the reflection 
s is identical to exchanging the electrons of one atom with the other because 
the reflection of the electron coordinates with respect to the nuclei results in 
the same factor +1 or — 1 for both <p (1) and <p (2) . It is therefore necessary to 
determine the factor which multiplies the eigenfunctions for the energy level 
A = 2A r , S under interchange of the/ electrons of one nucleus with those of 
the other nucleus. The representation A of S 2/ is characterized by S. According 
to XIV, §1 the corresponding representation A' in spin space differs only by 
a factor ( —l) p ; under the exchange of the / electrons of one nucleus with 
those of the other we obtain the factor (- 1) / . The factor in the representation 
A' is obtained from the formula for the addition of the angular momentum 
5] and S 2 = S x to obtain the total spin S is, according to XI, §10, equal to 
(— l) r (— l) s . Therefore we obtain e = (— l) s . 

For the case in which A = O 1 the eigenfunctions are 

P[<P i Ud(i* 2,..., f)(p L -\^ m (f + 1,..., 2/) 

± (p L -\ ik{1) (l, 2,..., /)<V (2) (/ + 1,..., 2/)]. 
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For the case A = 0~ in addition to the above considerations for the case in 
which A = 2Aj we get an additional factor ( — 1) in order that for A = 0~ 
the reflection character is s = (—l) s+1 and, only for the case A = 0 + is 
simply (— l) s . 

After we determine the molecular energy levels corresponding to the energy 
levels of the separated atoms, the question remains concerning the quantita¬ 
tive dependence as a function of the distance a between the nuclei. To begin, 
for large values of a, it can be obtained by means of perturbation com¬ 
putations, as we shall see in the following sections. We may only determine 
whether the two atoms are chemically bound from the behavior of E(a). 
Since E(a) = z l z 2 e 2 /a + E'(a), E'(a) must decrease with decreasing a. This 
condition may often be tested by considering the limiting case a = 0. For 
a = 0 we obtain an atom with nuclear charge Z x + Z 2 , the energy levels of 
which may be determined from XIV, §3. Here we may easily determine the 
molecular energy levels into which a particular energy level for an atom L, S 
(with reflection character w) (where Z = Z x + Z 2 ) will be split as a increases 
as follows: We have to take the subgroup d of the entire rotation group 
thus we find it necessary to reduce the representation Q) h according to (5.1). 
Since the permutation group is not affected by changes in a, S is conserved. 
For different atoms the L, S energy levels split into energy levels A = L, 
L- 1,..., 0; S. In order to determine whether we obtain a 0 + or an 0“ 
energy level we must consider the reflection operator s 2 ■ As we have already 
found in the discussion following (5.2), (—l) ! w describes the reflection 
character of A = 0 under s 2 . For identical nuclei we have to include the 
reflection s with respect to the midpoint of the two nuclei as a symmetry 
operation. For a = 0 this is identical to a reflection about the nucleus. For 
this reason the reflection character must be given by e = w. 

From the rule that energy levels of the same pedigree cannot intersect we 
find that, as we have found for the case of a single electron, it is possible to 
determine which energy level for a ~ 0 is transformed into a given energy 
level for a ~ oo as a changes continuously from 0 to oo. Atoms having closed 
shells have particularly low-energy (singlet) ground states. If both atoms 
have the same spin S as a -> oo then the singlet molecular energy level will 
have a particularly strong bound state since, for decreasing a, as a -> 0, the 
ground state must transform into a low-energy singlet ground state (see the 
example in §6). We shall call v = 2S the valence of the atomic state. We then 
can state the following rule: The saturation of the valences (for singlet states) 
leads to chemical binding. The extent to which this rule is applicable to more 
than two atoms will be discussed in §7. 


6 The Hydrogen Molecule 

We will now consider the hydrogen molecule in more detail as an example of 
the energy level scheme for the electrons of a molecule. In Figure 26 the 
behavior of the energy levels are qualitatively illustrated as the distance a is 
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Figure 26 The behavior of the energy levels of the H 2 molecule as the distance a is 
varied. 


varied. For clarity we have sketched the singlet and triplet terms separately. 
On the left-hand side (a = 0) we have the terms of the He atom and their 
splitting as the nuclei are separated. The S levels cannot be split; they have 
the symmetry character 1,3 E e + , as given by the rules of §5. The P levels must 
be split into the levels 1,3 £ e + , 1 , 3 n ff ; the D levels into 1,3 E e + , 1 , 3 n e , 1,3 A e . The 
energy levels of the H 2 molecule may, on the basis of the structure principles 
(§5), be considered to be derived from the one-electron terms. (Isa) 2 leads to 
a single 1 S e + level; ls<r 2pa leads to two levels 1,3 Xo- Since, according to 
Figure 22, the level 2p<r of an electron strongly decreases with increasing a, 
for H 2 it follows that the two levels arising from \sa 2pa and the ground state 
are the lowest energy states; the 3 £„ + level is unstable because it transforms 
into the ground state of the separated atoms. \sa 2pn leads to 1 , 3 n o levels, 
lsc 2sa to 1,3 Z e + levels, etc. On the right side (a ~ oo) we obtain the levels of 
the separated atoms H + H, where one of the atoms is in the ground state 
and the other is in one of the states Is; 2s, p; 3s, p, d, etc. With the help of §5 
we may easily show into which levels the terms Is + ns, p, ... are split as the 
atoms approach each other. The results are illustrated on the right-hand 
side of Figure 26. In conclusion we may then connect energy levels having 
the same pedigree. 

From these qualitative considerations it is already clear that the ground 
state 1 E e + leads to a strong chemical bond. We will examine this fact in 
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greater detail using perturbation theory. There are basically two methods: 
The first begins with separate atoms and considers the mutual interaction 
of the electrons and the interaction of the electron with the other nucleus as 
the perturbation. The other uses the Hartree method, that is, one electron 
functions in the field of both nuclei, and the interaction of the two electrons 
is taken into account only globally. 

In the first procedure we seek to approximate the eigenfunctions of the 
molecule by taking linear combinations of the two functions <p a (l)<p b (2) and 
(p b (\)<p a (2) where <p x and (p b are the normed eigenfunctions of the ground 
states of the hydrogen atoms a and b, respectively. Since the Hamiltonian 
operator is invariant under exchange of the electrons, 0* = (p a (l)(p b (2) ± 
(p b {\)(p a {2) must be the correct linear combinations for the perturbation 
calculation. The energy is therefore given by 


where 


<<£*, 

ll^l ! 2 ’ 


h = ~ (Pi + Pi) + - 


2m 


12 


'a 1 


e 

r a 2 


e 

rb\ 


e 

>'b2 


( 6 . 1 ) 


Here r 12 is the distance between the two electrons, r al is the distance of 
electron 1 from nucleus a, etc. Since, for example 


2m 



<Pa( 1 ) = E 0 (p a ( 1 ) 


it therefore follows from (6.1) that 


where 


E + 


2 E 0 + 


C ± A 

Y+s 2 ’ 


c = f|<p a (l)| 2 \(p b ( 2 )\ 2 (- -— - ^Wi d 3 r 2 , 

J \ r 12 r bl r a2/ 

A = [(p a (\)(p a (2)(p b (l)(p b {2) (- -— - —) d 3 r x d 3 r 2 , 

■J \ r 12 r bl r a2/ 

S = J%(l)%(l)d 3 ?i. 

E + is a singlet, E is a triplet. If A < 0 then the singlet energy level will lie 
below the triplet level. The expression e 2 /r 12 in A yields a positive contri¬ 
bution (for atoms this term is decisive; for example, for helium the triplet 
levels lie below the singlet level!). Here, however, the sign of A is determined 
by the two terms — e 2 /r bl — e 2 /r a2 . (For a precise determination of the 
integrals A, C, S see [15].) 

For the case of an atom (XII, §2) we were able to intuitively justify the 
energy differences between the singlet and triplet states by the fact that, for 
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symmetric electron position eigenfunctions, the electrons are closer than 
they would be for antisymmetric functions, where the electrons avoid each 
other. For the case of a molecule the effect of a symmetric eigenfunction 
does not only mean that the electrons would be closer, but that they would 
also be found more frequently in the vicinity between the nuclei than would 
be the case of an antisymmetric function. However, in the vicinity between 
the two nuclei the negative potential energy of the electron in the field of the 
two nuclei is very strong, and therefore the presence of the electrons between 
the nuclei will be more favorable energetically. 

The second procedure simply uses the one-electron function of the H 2 
molecule. The filling of the lsc state with two electrons leads to a binding, 
because, according to Figure 23, both electrons contribute to the binding. 
Here we speak of a binding electron pair. Here it is possible to carry out 
approximations for the ground state by using the Ritz variation principle 
(see XII, §3). 

In Figure 27 the observed energy level scheme for H 2 is displayed. 


7 The Chemical Bond 

In order to obtain a theoretical basis for chemistry it is necessary to be able 
to determine whether the lowest energy state of the electrons (including the 
Coulomb repulsion energy of the nuclei), that is, the energy E(r k ) defined 


Piifie. 7lltt£fccj<t giumZ Pfcy iioi 





7 The Chemical Bond 195 


in (2.2) is a minimum for a particular spatial configuration of the nuclei. 
Since an exact solution of the many electron problem for equation (2.2) is 
not possible it is necessary to make use of qualitative considerations and 
approximation methods. Here we shall not attempt to present a summary of 
the methods which can be used in particular cases in order to obtain usable 
results [16]. Instead, we shall only examine the general characteristics of 
the chemical bond. 

For more than two atoms the spatial symmetries are generally lost, and 
the only remaining symmetry is that of the permutation of the electrons. 
In §5 we have already found that the multiplet characteristics of a molecular 
energy level which arise continuously from the energy levels of the separate 
atoms are simply obtained by addition of the spins of the individual atoms. 
The energy levels which arise from the ground state of the separated atoms 
are, in general, not uniquely determined by the multiplet character. 
Nevertheless, the following rule is often valid: The addition of spin resulting 
in a total spin of zero leads to a state of stronger binding. This is the case 
because, as the atoms approach each other, they begin to resemble an atom 
with a closed shell, a situation which we have already discussed for the case 
of two atoms in §§5 and 6. It is clear that this can only be a rough rule. If, 
from each atom we draw the number of lines corresponding to its valence 
v = 2 S, then there is a parallelism between the reduction of the representation 
,@ Sl x 3>s 2 x • • • x 3> Sn for n atoms with spin quantum numbers S u 
S 2 ,...,S n and the combinatorics of the valence lines provided that we draw 
all “valence diagrams” in such a way that some pairs of the valence lines are 
bound together to single lines which run from one atom to another. The 
“valence diagrams” arising in this way correspond uniquely to the spin 
invariants in XIV, §7. If v = 2S is the valence of an atom, then we choose a 
basis for the representation space for <3 S as follows 


Ui,= 


„S + M,,S-M 
Li j. U — 


/(s + M)\(S - M)! 


where u + , u_ are the basis vectors for the representation S> 1/2 . We construct 
the spin invariants 

A = FI [« + (v)n-0t) - u_(v)r+(^)] 9v '* [“ + 0)x + u_(v)y] /v , (7.1) 

v<n v 

where g Vfl is equal to the number of valence lines between the vth and /rth 
atom (the bound valences) and /„ is equal to the number of free valence lines 
of the vth atom. (For the valence scheme given in Figure 28 we obtain 
9 is = 2, 0 23 = 1, 012 = 0, /n = 1, f 2 = f 3 = 0.) The coefficients of 

X s — 

A M — 


^S + MyS-M 

J(S + M)l(S - M)\ 





196 XV Molecular Spectra and the Chemical Bond 



• 2 

Figure 28 Valence scheme. 

in A transform then according to 3) s . We therefore obtain 2 S = £„/„ and 

/v + £ 0 V „ = n v (7.3) 

which is equal to the valence of the vth atom. 

If we then construct all possible spin invariants subject to the constraint 
(7.3) then we obtain basis vectors for all representations S> s which occur in 
the reduction of @> Sl x @s 2 x • • • x 2 Sn . For a given S there exist as many 
spin invariants as there are different valence schemes with free valences 
given by /„ = 2S. It is easy to give an example which illustrates the fact 
that it is possible to have more valence schemes for the same value of S than 
is permitted by the 3> s in the reduction of the 3> Sl x ^s 2 x • • • . This is based 
on the fact that not all the spin invariants corresponding to S are linearly 
independent. The number of complete linearly independent spin invariants 
for S must, of course, be equal to the multiplicity of 2> s . 

As an example, let us consider three atoms, each with valence v = 1. We 
then obtain S> 1/2 x ® 1/2 x ^ 1/2 = 2S 1/2 + ® 3/2 . For S = | there exists a 
single spin invariant. 

3 

A = [I [«+(v)x + u_(v)y]. (7.4) 

V=1 

For S = j there are three spin invariants 

A x = [« + (l)x + u_(l)y][u + (2)u_(3) - «_(2)u+(3)], 

A 2 = [«+( 2)x + u_(2)y][u + (l)n_(3) - u_(l)u+(2)], (7.5) 

A 3 = [u+(3)x + u _(3)y][u + (1 )m_(2) - m_(1)m+(2)]. 

Note that 

A 3 = A 2 — A t . (7.6) 

Of the three invariants in (7.5) only two are linearly independent. 

A much more difficult problem is that of finding the actual bound states 
and estimating their binding energies. The valence scheme of chemistry is 
only a first qualitative approximation of the real state of affairs. Many atoms 
exhibit multiple valences in their chemical behavior. It is not difficult to 
explain this situation. If the interaction energy of an atom with other atoms 
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in the molecule is not small compared to the difference between one of its 
excited states of a different pedigree and the groundstate, then we cannot 
exclude the possibility that the energy of a state arising from one of the 
excited states of the separate atoms, under continuous change of the separa¬ 
tion, will lie below the molecular energy level arising from the ground states 
of the atoms. One of the most important examples is the carbon atom, the 
binding possibilities of which form the basis of all of organic chemistry. 
The ground state is (see XIV, §4) a triplet state and is therefore a state of 
valence 2. There we have found that there is also a lower lying quintet state, 
that is, v = 4 arising from the configuration 2s 2 p 3 . How can such a strong 
bound state arise from this quintet state under continuous change of the 
separation of the atoms? To answer this question we need to present a more 
precise quantitative discussion. 

One method, which is based on the Heitler-London theory of the H 2 
molecule, seeks to estimate the binding energy by using perturbation theory, 
beginning with separate atoms. The calculations are completely analogous 
to those presented in XIV, §9, so that they need not be repeated here. We could 
begin by using the ground states of the free atoms as the zeroth approxima¬ 
tion. It is advantageous, however, to begin with the one-electron eigen¬ 
functions of the ground state electron configuration as the starting point of 
the perturbation calculation where the mutual interaction of the electrons 
in an atom are taken into account by means of average fields (see XIV, §9). 
Closed shells can, for the most part, be ignored because they do not play a 
significant role in binding because the closed shells of the individual atoms 
interact only slightly at an equilibrium distance. 

In contrast to XIV, §9 the initial one-electron eigenfunctions belong to 
different atoms, and therefore, they need not be mutually orthogonal. 
Furthermore, the rotation group is not a symmetry group of the molecule. 

For our first example let us choose / atoms, each with an electron outside 
its closed shell. Then, we may use the computations in XIV (9.2) to (9.21) 
directly. For x(Q ) we obtain the equation 

xa = ex, (7.7) 

where 

a = Y, °( P ) P = e Z C ik+ Z ( 7 - 8 ) 

P i<k i<k 


where in (7.8) we have neglected terms which are small compared to C ik and 
A ik and the nonorthogonality of the <p vt . By analogy with the H 2 atom the C ik 
and A ik are given by 


Cik — 


■A-ik — 


(pv,(i)(Pv k (k), ■{— ~ Z “-Z — WiWfc) )’ 

^ik j±k ajk j^i ?aji) 

(Pv t (k)<P Vk (i), -j— - Z --Z —}<Pv,(0<Pv k (fc) )> 

r ik j*k r ajk }%r aj i\ / 


(7.9) 


where r aji is the distance between the ith electron and the jth nucleus. 
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The solution can be carried out using the method of spin invariants 
described in XIV, §9. The computation is similar to the valence scheme, as 
we shall later show for a more complicated case. The lowest state corresponds 
to a certain linear combination x v A v of the spin invariants, but not 
necessarily to a specific valence scheme! (Consider, for example, three H 
atoms, and show that the formation of a H 3 molecule is impossible!) 

If the atoms have several electrons in the outer shells, then we must use 
equations XIV (9.22) to (9.30), except that the simplification in which a 
transition to a definite total orbital angular momentum is made is not possible 
because the molecule is not rotationally invariant. 

Let (p nkn denote the one-electron functions, where n = 1, 2,..., N and 
k n = 1,..., r„, where closed shells are not taken into account. For fixed n the 
(p nkn may correspond to the same energy value for an electron on the nth 
atom. Here we have used k instead of m since we can start with other linear 
combinations as we have done in XIV (9.22). The secular equation XIV 
(9.30) can be written in terms of spin invariants for the case of / electrons as 
follows: 


I 

n'lk'Wi •• • n'fk'n'f 


^ n ik n i ■ ■■ nfk n f', ft\k n 'i • • • nfk n 'j- 


i— 

I (k 'Kvn)A (vti)ntknl ... n fk n f> n\k nj nfk n y I &n'ikn\ ■■ ■ n'fk' n 'j 
v</i J 


E n\ kni" ' Hfkn 1 'flfkn f ' 


(7.10) 


In (7.10) we have neglected all terms for which P # e and P # (v/t) and the 
nonorthogonality of the cp nkn ; the E„ ikni ...„ fknf differ only slightly (relatively 
to the perturbation). Equation (7.10) is already too complicated in order to 
be able to solve for the various positions of the nuclei. It is often possible, 
however, to choose linear combinations of the (p nkn for which the C and the 
A lyfl) are diagonal (at least approximately) with respect to the n 1 k„ l ■ ■ ■ n f k nf ; 
n \K'i ' •' rifK'j in order that (7.10) can be decomposed into a number of 
subproblems of the form (7.7) and (7.8). Each of these subproblems is charac¬ 
terized by a well-defined distribution of the / electrons (independent of their 
labeling) in the states (p„ lkni ■ • ■ (p„ N k nN ■ Often we may guess which of these 
problems leads to the lowest binding energy (see, however, the remarks about 
polar bonds below). We will provide a more detailed explanation following 
the example of the benzene ring. 

For a subproblem we shall replace the indices n x k ni ,n f k„ f by 1,2 
According to the Pauli exclusion principle we may have at most two equal 
< Pi , thus we find that the (p i occur only either in identical pairs (p t = (pj or in 
simply occupied eigenfunctions. 

We may organize the (p t such that the which occur in pairs can be num¬ 
bered such that <p x = (p 2 ,cp 3 = <jP 4 ,.. -, (p 2v ~ i = <p 2 v and the (p 2v+ j,..., 
occur only once. Then a subproblem obtained from (7.10) can be expressed 
as follows: 

C - I Adi, k)L = (JE - £)0s (7.11) 

L d,k) J 

(U k) # (1, 2); (3, 4) •••; (2v — 1, 2v). 


7lltt£fccj<t giumf Pfcy iioi 
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Consider the exchange (1, 2). Since cp k = cp 2 we find that A lk = A 2k , A kl = 

A k2 . Since (1, 2) 2 = 1 and (1, 2)(2, k)(l, 2) = (i, k) for i, k # 1, 2 and 

(1, 2)(1, k)(l, 2) = (2, k), (1, 2)(2, k)( 1, 2) = (1, k ) we therefore obtain 

( 1 , Z)[c-Z *)1(1, 2 ) = |c - X fc)l, 

L (i.fc) J L (i.fc) 
that is, the permutation (1, 2) commutes with the operator 

C- I A ik (i, k) 

(i,k) 

Therefore the 0 S can be chosen such that they are eigenvectors for (1,2). 
The eigenvalue +1 can be excluded because of the Pauli exclusion principle 
(because = <p 2 !); therefore the 0 S must be antisymmetric in 1,2; 3,4; 5,6, 
etc. The quantity j[l — (1, 2)] and the product e = — (1,2)] 

i[l — (3, 4)] • • • f [1 — (2v — 1, 2v)] reproduce 0 S : e0s = 0s- Since e 
commutes with [C — A ik(U kj] we therefore obtain 


C - S MU k) 

(i,k) 


0S ~ 


= e 


C - I MU k) 

( i,k ) 

c - X MU k) 

(i,k) 


«0s 

<t>s = (E — E)4> s . 


For the spin invariants we shall use the following simplified notation: 
w +(1) m -(2) — h_(1)u + (2) = [1,2], 
u + (l)x + u_(l)y = [1, z]. 

Therefore 0 S must have the form 


(7.12) 


0s = [1, 2][3, 4] • • • [2v — 1, 2v] • • • [k, 0 • • • [m, z], (7.13) 

where k, /, m • • • > 2v. 

Thus it is easy to show that 

e(l, 2)0 S = -0 S , 

e(l, k)0 s = i0s, 

e(l, m)0 s = 

e(2, 3)0s = 20s- 


Therefore (7.11) can be written as follows: 



I 


(i,k) 

i,k> 2v 


MU k ) 


0s — (£ — £)0s. 


where 


C = C - ^ X i I ^ 

^ (a, b) ^ i< 2v 

k> 2v 


(7.14) 


(7.15) 
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where fc) is taken over all pairs a < 2v, b < 2v for which <p a # <H- Since C 
is independent of the different possible choices of <D S , its magnitude is 
unimportant for the determination of the lowest bound state, that is, the 
lowest eigenvalues of (7.14). Equation (7.14) has a form in which the pairwise 
(p t are no longer present, and therefore we omit the first factors [1,2] [3,4] ... 
in 4> s . Therefore we may also omit the constraint i, k > 2v under the summa¬ 
tion sign in (7.14). 

As we have suggested earlier, here we may also characterize the spin 
invariants by a valence scheme. To this purpose we place a point on a circle 
for each of the remaining functions <j o t . Here it is advantageous to line up the 
points for <p t of the same atom (see Figure 29; for the benzene ring see Figure 
32). Then we add a particular point 0. Then we draw a line connecting pairs 
of points such that, with the exception of 0, only one line connects each point; 
from O there shall be 2 S lines. For each such valence scheme there corresponds 
a spin invariant c/> s , because a line between the ith and kth point corresponds 
to a factor [i, /c] in 4>s and a line between the mth point and 0 corresponds to 
a factor [ m , z]. All the 4>s formed in this way are not, however, linearly 
independent. We obtain a complete system of linearly independent spin 
invariants <j> s if we use only those valence schemes in which no lines cross. 
Here, on the basis of 

[i, jfc] [/, m] = [i, /] [k, m] + [i, m] [/, fe] (7.16) 

we observe that valence schemes which cross can be replaced, step 
by step, by noncrossing valence schemes. Thus spin invariants with crossing 
can be linearly combined by noncrossing valence schemes. On the other hand, 
it is easy to see that the number of noncrossing valence schemes is equal to 
the dimension of the representation of the permutation group corresponding 
to S. We can also use (7.16) to obtain the expressions (i, k)4> s in an elementary 
way. 

It is possible that the problem (7.14) can be reduced further—if the sub¬ 
space of all the 4>s = [a, b] •.. (which contains the same factor [a, h]) is 
transformed into itself by C - £ A ik (i, k) and the lowest eigenvalue for an 
4) S lies in this subspace. We shall then refer to [a, fe] as a localized valence. 
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This is therefore certainly the case if all A ak — 0, A bk = 0 (k # a, b). Then 
(7.14) is transformed into 



z 

<U) 

i,kiEa,b 


A ik (ik) 


<t>s = (E ~ E)(f> s , 


(7.17) 


where 


C — C + A ab . 

In (7.17) we can neglect the factor [a, h] in </> s . In this way we may eliminate 
all localized valences step by step. We then obtain a problem of the same 
form (7.14) with C instead of C. 

We shall now replace the operator (ik) by 4(1 + 4 S (i) ■ S (k> ) as we have in 
XIV (9.21). Then we obtain the equivalent problem 


C-\ I A ik -2 £ A ik S" ■ S (k) 


' d,k) 


(i,k) 


4>s = (E- E)(p s . (7.18) 


If the ith and kth spin in </; s is coupled by a valence as follows 
[i, k ] = u + (i)u^(k) — u_(i)« + (fc) 

then we find that S {i) • S ik) cj) S = - f </> s . If, on the contrary, i is not connected 
by the valence scheme to k then the expectation value 

M(s (i) • S<*>) = <0 S , s (i) • s (k) <t> s y = o. 


For cj) S as the solution of (7.18) with smallest eigenvalue ((j> s need not belong 
to a particular valence scheme, but may be a linear combination!) we may 
consider the expectation value of D. ik = —jS (i) ■ S (k) to be a measure for the 
occurrence of the valence i — k because, under the localization of valence it 
takes on a value of 1, and for noncoupled valence takes on the value 0. 

We will now consider the example of the benzene ring, the usual valence 
scheme of which is illustrated in Figure 30. We shall shortly find out whether 


H 

i 

H v /\ 

C C 

I II 


H 

Figure 30 The usual valence scheme of the benzene ring. 
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this valence scheme reflects reality. The valence of carbon in this scheme is 
four. As we have already mentioned above, the electron configuration 
Is 2 2s 2 p 3 plays a decisive role for the binding, although the ground state of 
the C atom arises from Is 2 2s 2 2p 2 . We shall replace the four eigenfunctions 
which are available for the s and p electrons T^AiO') with m = 1,0, — 1 
by other linear combinations. We now select one of the C atoms and choose a 
coordinate system for which the third axis is perpendicular to the plane of 
the benzene ring. Let e u e 2 , e 3 be the three components of a unit vector, let 
Zo = <Ao (r), Zi = * 2 = e 2 ^ 1 (r)a 2 , * 3 = e 3 iA 1 (r)a 3 where the 

a; are normalization constants; we shall choose the following linear com¬ 
binations : 


<Po — 71 Zo + 

(Pi = n/I Zo + n/I Zi 
(Pi = v 7 ? Zo - Vt Zi 
<? 3 = Zs- 

The (p 0 ,(p 1 ,(p 2 have significant charge densities in three directions in the (12) 
plane, separated by angles of 120°. The <p 0 , (p u cp 2 are invariant with respect 
to rotation about these axes. If we choose the initial functions for each C atom 
in this way, then for each C atom we obtain appreciable exchange integrals 
for the cpi (i = 0,1,2) only for neighboring C or H atoms which lie in the three 
directions, and we find that the chemical bonds illustrated in Figure 31 are 
easily localized. Thus we can limit our consideration to the remaining six 
electrons which correspond to the eigenfunction q> 3 . If we number the C 
atoms cyclically, then only the exchange integrals ^4 12 = A 23 = A 34 = 
A 45 = A 56 = A 61 = A are significant, reducing the problem to the solution 
of the following equation 

{C - Am, 2) + (2, 3) + (3, 4) + (4, 5) + (5, 6) + (6, 1)]}0 S = (E - E )<fe. 

(7.20) 


/ §Z 2! 


- v / iz2> 




(7.19) 



Figure 31 The localized bonds of the benzene ring. 
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The lowest energy level occurs for the case in which S = 0. The five possible 
linearly independent spin invariants are given by the following diagrams: 




Figure 32 



5 


Under the cyclic perturbation P = (1, 2, 3, 4, 5, 6) we obtain 

— 0 ( 2 ); ?<!>( 2 ) = 0 ( 1 ); ^ 0 ( 3 ) = 0 ( 4 ); ^ 0 ( 4 ) = 0 ( 5 ) 

and P0 (5) = 0 ( 3)3 

[(1, 2) + (2, 3) + (3, 4) + (4, 5) + (5, 6) + (6, 1)] = £ P"(l, 2)P~". 

n = 0 

Using (7.16) we may readily calculate (1,2 )</> (k) and therefore also 
P n ( 1, 2)P~ n (p (k) . The smallest eigenvalue of (7.20) is given by 

E - E = C + A(- 1 + 713) = C + 2.6 A. (7.21) 

For the spin invariant (a linear combination of the cf> (k) ) belonging to this 
eigenvalue the expectation values M((l, 2)) = M((2, 3)) are given by 
i(l - Vl3), that is, 

M( Q 12 ) = M(Q 23 ) = • • • = —fM(S (1) • S <2) ) 

= iM(-2(l, 2) + 1) = 4(2 + 713) 

= 0.62. 

Without taking the valence coupling of the atoms, that is, for 

M(Q 12 ) = • • • = 0 

we obtain the energy from (7.18) as follows: 

E- E=C - 3A. 
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For localized valences, that is, for example, M(Q 12 ) = 1, M(Q 23 ) = 0, 
M(Q 34 ) = 1, M(Q 45 ) = 0, M(Q 56 ) = 1, M(Q 61 ) = 0, the energy is equal to 

£-£ = C + |A = C+1.5A. 

The actual binding energy is therefore approximately A(6 ■ 0.62 — 3) = 0.7 A 
greater than that for local binding. In this case we speak of a resonance of the 
valence diagrams, and 0.7 A is called the resonance energy. Using the com¬ 
putation methods described above we find that additional computation is 
required in order to show that the case of the six-sided benzene ring is sub¬ 
stantially preferred than similar cases of five- or seven-sided rings. We shall 
use another method to show that this is the case. See Figure 33 for a com¬ 
parison of the above results with experiments. The digram illustrates the 
distance between two neighboring C atoms for simple a binding and in¬ 
creasing n binding (for the concept of a and n binding, see below (page 205)). 
Therefore, for simple a binding the distance is 1.54 A; for a + localized n 
binding the distance is 1.33 A. The measured distance in benzene is 1.39 A, 
which, according to Figure 33 corresponds to approximately 

M(Q 12 ) = • • • = 0.66, 

from which the fact of resonance of valence diagram is proven. 

Whether it is possible to obtain a satisfactory approximation from 
beginning with neutral atoms depends upon whether the eigenfunctions 
for a bound molecule actually have, to some degree, the form of those of the 
separate atoms. This is certainly not the case for the so-called polar bond such 
as HC1. As the atoms approach from a ~ 00 the eigenfunctions change so 
drastically that, in the case of a bound molecule, the electron from the 
hydrogen atom is, for all practical purposes, completely attached to the 
chlorine atom, Here it is also correct that the bound singlet state of the 
molecule is obtained continuously from the state of the separate atoms 



Figure 33 The distance between two neighboring C atoms for simple a binding and 
increasing n binding. 
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(Figure 34); however, a better approximation is obtained from perturbation 
theory by beginning with the eigenfunctions of the separate ions Cl ~ and H + , 
which separately have a highpr energy than that of the neutral atoms (Figure 
34). The perturbation calculation beginning with neutral atoms yields a 
curve (1) as a function of a (with the transition indicated by a dashed curve), 
while that for the ions yields the curve (2). By simultaneous consideration 
of both initial states we obtain the solid curves, because terms of the same 
pedigree cannot cross over. 

A second method which may be used to attack the problem of the chemical 
bond is equivalent to the structure principle described in §4 for two molecules. 
The mutual interaction of the electrons is taken into account only in the sense 
of the Hartree procedure. Here we begin with the one-electron functions, the 
corresponding energies of which are qualitatively given in Figures 23 and 24. 
These one-electron functions are approximated in terms of linear combina¬ 
tions of the eigenfunctions for the separated atoms. If we can construct such 
linear combinations such that we obtain an essential increase in the electron 
charge density between the atoms, then we obtain a decrease in the associated 
energy (as we have seen in the case of the H 2 molecule in §6), that is, we obtain 
a contribution to the binding energy of the molecule. According to the Pauli 
exclusion principle, such a state can, on account of the spin, be doubly 
occupied. If, in a multi-atom molecule, the linear combination which corres¬ 
ponds, to the lowest energy state arises from two functions from each of the 
atoms, then we may represent these two electrons by a valence line. Here we 
speak of a localized valence and of a bounding electron pair. In the H 2 
molecule we have seen such a bounding electron pair. If the eigenfunctions 
are rotationally symmetric about the line joining the two atoms, we then speak 
of a a bond. If the eigenfunctions correspond to the representation A = 1 of 
the rotation-reflection group d then we speak of a n bond. 

For an example we again consider the case of the benzene ring. We need 
only consider the electrons which lie outside the (n = 1) shell, because the 
electrons in the (n = 1) shell are less perturbed as the atoms approach. For 
the four electrons with n — 2 we again use the atom functions given in (7.19). 
The functions cp 0 , (p u q> 2 between neighboring atoms (C—C and C—H) 
result in localized a valences. It now remains to specify one electron and one 
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atomic function (p for each C atom, which, for the kth atom, we shall denote by 
4> k . Let us consider a ring of N C atoms. Let V k be the potential of the kth 
atom, which, for large distances from the center of the atom must behave 

like- e 2 lr k because the shielding is such that only a single charge remains. 

The i j/ k will satisfy the equation 

H k i// k = E 0 i// k with H k = K + V k , (7.22) 

where K is the kinetic energy (1/2 m)p 2 . We must determine the eigen¬ 
functions of the equation 

N 

H(j> = E(j>, where H = K + £ V k . (7.23) 

k= 1 

For an approximation to cj> we set 

<t> = £ k ( 7 - 24 ) 

k= 1 

and we seek extreme values of <7/>, //(/>/<</), </>>. We obtain the system of 
equations 

£ H ik a k = Ea h (7.25) 

k= 1 

where <t^, iK> ~ S ik and 

H lk = + <^, X W- 

Neglecting the nonorthogonality of the i// h for i / k we obtain 

H ik = (ti, (7.26) 

H ik is, for all practical purposes, nonzero only for neighboring atoms i, 
i + 1. As a result we need only consider the terms in the sum £,■** 1/ for 
which j = i, and we obtain 

= < 1 h, Viil/ i+ 1 >. 

Since all the H ik are the same, we set H iti+1 = —A (A > 0). For i = k, in the 
same approximation we obtain 


Ha = <<Ao = £ 0 + \*i, £ ^ ~ £o- 

We define the operator T as follows: 

Til* k = <A fe +1 and = (7.27) 

We may therefore write the system of equations (7.25) in the form 

[£ 0 - A(T + T~ ‘)]0 = Ecj>. (7.28) 


Piifie. 7fttt£fccj<t giumf Pfcy iioi 



7 The Chemical Bond 207 


From T N = 1 it follows that the eigenvalues of T are the Nth roots of 
unity e i2mlN)v . The eigenvalues of (7.28) are therefore given by 

2n 

E v — E 0 - 2 A cos — v with v = 0, 1, 2,..., N - 1. 


In order to obtain the eigenvectors <j> v we consider the condition 
T<t> v = £ 1 = E 4 V) «K 

fc=i fc =i 

from which it follows that 


fll v) = e -(2ni/N)kv 

The N electrons are to be assigned to the eigenfunctions </> v in such a way that 
pairs in the same state are found in the lowest possible energy levels E v . 
The energy level E 0 — 2 A is doubly occupied, E 0 — 2 A cos 2n/N has 
occupancy 4, the others also have occupancy 4. Rings with N = 6, 10, 14 
therefore have closed shells and are energetically more favorable. In the case 
of localization of valences each of the electrons have an energy of E 0 — A, 
that is, the total energy would be E = N(E 0 — A). Let E = N(E 0 — kA ). 
In Figure 35 we exhibit the dependence of k as a function of N. For the 
benzene ring we obtain k = f. A single localized bond corresponds to a 
contribution of —2 A to the energy. Therefore k/ 2 = 2/3 = 0.67 is the 
strength of the n bond in the benzene ring, compared with the value 0.5 
neglecting the resonance of the valence diagrams and 0.62 according to the 
Heitler-London procedure (Figure 33). 



Figure 35 The binding energy in a ring of N C atoms. 
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8 Spectra of Diatomic Molecules 

We will only discuss the actual energy levels of the entire molecule for the case 
of diatomic molecules. The derivations of §2, in particular, the expression 

(2.5) cannot be used without modification, because the energy levels for 
A # 0 are doubly degenerate. For A = 0, using the expression (2.5), and the 
Ritz variation method, we obtain (2.6) and (2.7). For diatomic molecules 

(2.6) takes on the special form 

- ~ A «A + E(r) = efc (8.1) 

where 1/M = l/m 1 + l/m 2 and r replaces r' k . 

According to the contribution of XI, §3, the solutions must take on the form 

Hr) = Y‘ m ((0, (p)g(r), ( 8 . 2 ) 

where g satisfies the equation 

-2i?(^ (r!, )-^ !, ) + E(r)9 - 88 ' <83) 

We will postpone the solution of this equation until later, where we shall also 
consider the case A ^ 0. 

The exact eigenvectors of the total Hamiltonian operator must correspond 
to a definite total angular momentum quantum number K: 

Since the ^ are basis vectors corresponding to the representation 0) K of the 
rotation group, the entire set of values for ^ are already determined pro¬ 
viding they are known for a particular position of the vector joining the 
nuclei. Let D be a rotation which rotates the vector r (in polar coordinates 
6, (p, r ) in the direction of the 3-axis, that is, in Q — 0, (p = ■ • •, r). Then we 
obtain 

r K ) = D({)ff(Dr; Dr K ) 

= X ^M'i^r; Dr K )D M : M = ^ •••)?; Dr K )D M : M , (8.4) 

M' W 

where D M M is the representation matrix in the 2) K representation. The rota¬ 
tion D is a function of 9 and qr, it is, however, not uniquely determined, 
because if D 3x is a rotation about the 3-axis by an angle a, then D 3a D also 
satisfies the condition imposed on D. Thus we obtain 

<t>M(r; r K ) = X 0M'( 0, D 3x Dr K )e~ iMx D M , M 

M' 

= Z D 3 UU0, . • •, r; Dr K )e~ iM *D m m 

M' 

— Z •••>*■> Dr K )D M , M , (8.5) 

M' 


TfLa^JtASjp UM^i c jaJ. 
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where the latter is obtained from D^(Pm' = e iMx <pM', with the result that 
the ambiguity of the reaction D plays no role. 

Under the assumption that the eigenfunctions can be written, at least 
approximately, as a product (2.5) or as a linear combination of such products, 
we may therefore write, as an approximation, 

K r(0,. -., r; r') = g M (r)(p r (r' K \ 

where Dr K = r' K , where the <p,(r() belongs only to a single value of the electron 
energy E(r). Since <p^', upon multiplication by D 3a , is multiplied by the factor 
e~- iM'a, an( j ; s j nce) f or a eigenvalue E(r) of the electron energy it must corres¬ 
pond to a definite quantum number A we find that <^.(0, • ■ •, r, r' K ) must 
(approximately) vanish for M' # + A. For A = 0 we therefore obtain 

<£o( 0, ...,r,r' K ) = g(r)<p(r' K ), 

</&, = 0 for M' # 0. 

Thus we obtain 

<PUr, r K ) = g(r)(p r (Dr K )D 0M . (8.6) 

This is, however, identical to (8.2) and (2.5), because D 0M is, up to an un¬ 
important factor, identical to the spherical harmonics Y m(M, cp). The D om are 
functions of r: D 0M (r). Since D must be such that Dr lies along the 3-axis, the 
rotation DD (1) corresponds to the vector D^r, that is, rotates it to the 3-axis. 
It follows that 

D 0M (D (1 ^ = (DD( 1)) 0 M = X! 

M' 

As functions of r the D 0M belong to the representation 3) K . Therefore they 
must be equal to (up to a factor) Y Therefore, in the case A = 0 we may 
identify K (according to (8.2)) with l. 

The Hamiltonian operator is also invariant under reflection s of all 
coordinates (nuclei and electrons). Thus (8.6) must belong to a definite 
reflection character. In order to determine the latter, we observe that 
^ 2 (rt)S = s i where D 2(7[) is the rotation about the 2-axis of angle n. Thus it 
follows that 

r K ) = </>m(.st; srj = g(r)(p(D's? K )D' 0M , 

where D' is the rotation which rotates sr in the direction of the 3-axis; there¬ 
fore D' = D 2{n) D. Therefore, since D 2(n) s = s 2 , we obtain 

SfpUr; r K ) = g(r)cp(s 2 Dr K ) £ D 2(n)0M .D M , M . 

M' 

According to (VII, §3) the D 2(n)0M have the form (- 1 ) k <>om; thus, from 
s 2 (p(Dr K ) = £(p(Dr K ) it follows that 

s<Pm(y r K ) = e(— \) K g(r)(p(Dr K )D 0M = e(-1 ) K <p&. 

The reflection character with respect to s for (8.6) is therefore ( — 1) K for 0 + 
and ( — 1) K+1 for CT terms. 
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For A # 0, instead of (8.6), it follows that the approximation for the 
eigenvectors is given by 

4> m (7'» K) = g +(r)(pA(Dr K )D AM + gJ>')<P- A (Di : K )D^ AM . (8.7) 

Since the eigenvectors must belong to a specific reflection character with 
respect to s, we may determine the ratio of g + to : As above, we obtain 

r K ) = 9+(r)(p A (D'sr K )D' AM + g_(r)(p- A (D'sr K )D'_ AM 

= 9 + ( i ') ( Pa( s 2^k) X d) 2(n)AM D MM 

AT 

+ g-(r)(p^ A (s 2 Dr K )Y J D 2(n) ^ AM :D M ' M 

M' 

= 9+(r)<p- A {Dr K ){ - 1) K ~ A D _ AM + g~(r)<p A (Dr K )(—l) K+A D AM '. 


Since s <j)^ = we must therefore have 

Mr) = (-l)* + V? + (r). 

Both values w = ± 1 are therefore also possible for A ^ 0 for the approxi¬ 
mate eigenvectors (here we set g+ = g): 

4>Ur, r K ) = g(r)(p A (Dr K )D AM + (- 1) K+ \vg(r)<p ... A (Dr K )D _ AM . (8.8) 

Using the eigenvectors (8.6) or (8.8) we may apply the Ritz variation principle 
in order to determine g(r). 

For g(r ) we obtain the following differential equation 


-2 Ji7^-Wrl^ W ^ +U ^ = ^ (8 ' 9) 

where 

U(r) = E(r ) + <M 2 ) ~ J <P A (r K ) ^~ 2 9A(r K )(d 3 r K ) f (8.10) 

and 

W(r) = J (p A {r K ) <p A (r K )(d 3 r K ) f , (8.11) 

where M is the orbital angular momentum of the nuclei and 

<M 2 > = K<Pa(D7 k )D am + (-1 ) K+A w<p A (D? K )D_ AM , 

M\<p A {D7 K )D AM + ( — l) x + A w(/9_ A (£)r K )D_ AM ]>. (8.12) 


The third summand in (8.10) represents only a slight change in E(r) and will 
therefore be ignored in the following. The same is also true for W(r). It only 
remains to compute the term (8.12). With L as the orbital angular momentum 
of the electrons and K as the total orbital angular momentum, we obtain 
K = M + L. The L ; component commutes with the ith component of K, 
from which we find that 

M 2 = (K — L) 2 = K 2 + L 2 — 2K ■ L = K 2 + L 2 — 2L- K. 


7fttt£fccj<t giumf Pfcy iioi 



8 Spectra of Diatomic Molecules 211 


Thus for K 2 (j)u = K(K + we obtain 

<M 2 > = K(K + 1) + <L 2 > - 2<L • K>. (8.13) 

In the last term we shall, according to VII (3.17) and (3.18), apply the following 
substitutions: 

KM = y(K + M)(K -M+ 1 ) 

+ M + 

K 2 <£m = -|y(K + M)(K-M + 1) ^_ 15 
+ iV(K-M)(K + M+l)^ +1 , 

Thus, we find that all the operators in (8.13) which act on the nuclei co¬ 
ordinates have disappeared. Since (8.13) is invariant under rotations, we 
may choose the position of the line joining the nuclei along the 3-axis (here 
the rotation D in (8.8) is equal to 1) such that, according to (8.8) we obtain 
d am = <5am and £>_ AM = <5-am> and, accordingly, only the <f>f, for which 
M = + A are nonzero. Then, since i = <t >a +1 = 0, we obtain 

<L-K) = A | ^K)L, cp A (r K )(d 3 r K y = A 2 , 

J (8.14) 

<L 2 > = A 2 + j(p A (f K )(Lj + L 2 2 )<p A (r K )(d%y. 

The last term in (8.14) cannot, in general, be computed. If we neglect it, as we 
have above for the last term in (8.10) and for W(r), then for (8.9) we finally 
obtain 

- + 2aJ? [K(K + » - A ' ]9 + - v- (8.15) 

Here [ K(K + 1) — A 2 ] has a very intuitive meaning as the value of M 2 . 
If the electron angular momentum lies (for all practical purposes) along the 
line joining the nuclei [this is equivalent to neglecting the last term in (8.14)], 
it then follows that M is perpendicular to the line joining the nuclei, that is, 
K 2 = M 2 + L 2 or M 2 = K 2 — L 2 . With K 2 -> K(K + 1) and L 2 -*■ A 2 we 
obtain the above result. 

For A # 0 the equation (8.15) is independent of the reflection character w. 
The approximate value e obtained in the above manner is therefore the same 
for both solutions (8.8). In higher approximations there must be a small 
splitting between the energy levels w = +1 and — 1, which has been observed 
for large angular momenta. 

Equation (8.15) is the eigenvalue equation for an anharmonic oscillator 
which oscillates about the position p of the minimum of the function 
U(r ) = (1/2 M){K{K + 1) — A 2 )/r 2 + E(r). From the large value of the 
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mass M, for moderate values of M 2 we may approximate U(r) (where r — p 0 
is the position of the minimum of E(r )) as follows: 


U(r) — 


1 (K(K + 1) - A 2 ) 
2 M ? 


+ 


(r - Po) 2 


E"(p 0 ) + E(p 0 ). 


Under the above assumptions the position p of the minimum of U(r) lies 
near p 0 ; making a series expansion of U{r) about p 0 we obtain 

U(r) * E(po) + K(K + P ~ A ' + A[_K{K + 1) - A 2 ] 2 
2 M 

+ Ur~ Po) 2 E"(Po) 

and we therefore obtain the eigenvalues 

1 K(K + 1) - A 2 


s Kn — E(Po) + 


2 M 


Po 


+ AIK(K + 1) - A 2 ] 2 


+ 


E"(P 0 ) 


M 


(» + !)> 


(8.16) 


where 


M 2 E"(p 0 )pf 

Here s Kn is obtained by adding the electron energy E(p 0 ), the oscillator energy 
co(n + 1/2) where co — JE"I'M and the rotation energy 

/ 1 \((K( K + 1) - A 2 \ 

\2m)\ pi )■ 

The energy level scheme has the structure illustrated in Figure 36. The 
transitions between the energy levels produce the spectrum. If the electron 
energy and the oscillator energy are held fixed, then by changing the rotation 
energy alone we obtain the rotation energy lines in long-wave infrared, the 
so-called rotation spectrum. By changing the oscillation and rotation 
energies we obtain an infrared spectrum which has fine structure due to 
rotation. If we change the electron energy we obtain the band-structure in 
the visible part of the spectrum. 

The reader may wish to show that the following selection rules are 
applicable for dipole radiation: 

K -> K ± 1, K but not 0 -* 0, 

w —► —w, 

A -> A ± 1, but not 0 + -> 0~ and 0~ -> 0 + , 

where, for 0 + -> 0 + and 0~ -*■ 0~ and K -* K the transition (since 
w -> — w) is forbidden. 
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R 

Figure 36 Energy level scheme of a diatomic molecule. 


The reader will be able to examine the behavior of the eigenfunctions for 
the case of molecules having two identical nuclei under exchange of the 
nuclei, also after we have taken into account the electron spin in the next 
section. How will the eigenfunctions behave in this case of identical nuclei 
when both nuclei have the same spin (j) and the nuclei must therefore 
satisfy the Pauli exclusion principle (para and ortho hydrogen)? 


9 The Effect of Electron Spin on Molecular Energy Levels 

The term H s in (1.1) does not alter the symmetry group. We may therefore 
obtain the possible molecular energy levels by “switching on” H s in the 
form XH 5 (where A changes from 0 to 1). Each molecular energy level belongs 
to a particular representation with respect to the permutation of the electrons 
which is uniquely related to the spin quantum number S. We therefore need 
only add the total orbital angular momentum (quantum number K ) and the 
spin angular momentum (quantum number S) for the corresponding energy 
level in order to obtain the total angular momentum J. This addition there¬ 
fore leads to a splitting of energy levels with the increasing influence of XH S 
into components with total angular momentum quantum numbers J — K + 
S,K + S - - S\. 
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If, under complete inclusion of H s (that is, for X = 1 ) the splitting of a 
molecular energy level remains small compared to the separation of two 
neighboring rotation energy levels, then we have obtained a qualitative 
description of the position of the energy levels and we may then apply the 
selection rules. 

If, on the other hand, the spin splitting is larger than the separation of the 
rotation energy levels, we do not obtain a qualitative overview of the energy 
levels in this way. Here it would be preferable to take the spin into account 
in computing the electron energy levels. The electron energy levels under 
consideration with the quantum number A belong to a representation of the 
permutation group of the electrons which correspond to a spin quantum 
number S. Let U s , L/ s _!,..., l/_ s be the 2S + 1 spin functions (the eigen¬ 
functions for the spin component about the line joining the nuclei), then, 
according to the scheme (5.1) we need only reduce the representations 
a x x j/ s _!,..., x ^ 1/2 or ^ a x .^ 0 - I n order to deter¬ 

mine whether the function U 0 belongs to the representation .s/ 0 + or s4 0 - we 
need to compute s 2 U 0 . Since s 2 - D 2(n) s and sU 0 = U 0 (from VII, §7) from 
the remark at the end of (5.2) it follows that s 2 U 0 = (— 1) S U 0 , from which it 
is possible to decide whether we have the case s/ 0 + or s/ 0 ~. 

Let denote the representation which arises from this reduction, then 
all considerations from §8 are easily applicable if we replace A by Q. The 
electron energy levels are characterized by Q instead of by A. 

In closing this chapter, we note that more comprehensive methods have 
been developed which make the structure of the chemical bond more under¬ 
standable (see [16]) and which makes it possible to investigate the structure 
of molecules by investigating their spectra [17]. 
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CHAPTER XVI 


Scattering Theory 


In XI we have given several examples of operators W which correspond to 
ensembles, and F which correspond to effects that occur in the investigation 
of the spectrum of atoms and molecules. Since we do not yet have a more 
comprehensive theory which permits us to obtain the maps 3' K and 
3F L from information about the construction of the preparation and 
registration apparatuses, in XI we have “guessed” the form of the operators 
We K and F e L on the basis of partly theoretical and partly heuristic 
considerations, in order to formally introduce them as axioms in MS'. This 
procedure shall now be extended to the more comprehensive domain of 
experimental results: We shall now consider the large domain of scattering 
experiments. 

In §1 we shall formulate the general properties of the ensembles W used 
in these experiments. In §2 we will do the same for the effects F. Later, in §6 
we shall exhibit special operators W and F in order to define the so-called 
effective cross section. 


1 General Properties of Ensembles Used in 
Scattering Experiments 

In a scattering experiment different microsystems are shot at each other. 
Here we need to recall the considerations in VII concerning composite 
systems and interactions. We shall not, however, resort to the route taken 
there, but, proceed in a different fashion which assumes the description of 
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composite systems set forth in VIII and draw conclusions for the case of 
scattering experiments. In order not to complicate matters we shall not begin 
with the most general case, but restrict our initial consideration to composite 
systems consisting of two elementary systems. According to VIII (2.12) the 
Hamiltonian operator of the system can be written in the form 

H = H 0 + Hj, (1.1) 

where 


H 0 = 

2m x 2 m 2 


( 1 . 2 ) 


is the kinetic energies of the two elementary systems and Hj is the 
“interaction”. We shall at first make no assumptions concerning the form of 
Hj ; later we shall consider the special case in which Hj describes the potential 
energy. 

Furthermore we shall assume that the two systems (1) and (2) are different 
in order that we may use the structure described in VIII, §2. In §4.8 we shall 
briefly consider scattering processes for “identical” systems. 

In the following we shall prove theorems which are so general that, in 
some cases, they can be applied to cases in which the assumptions made above 
are not satisfied. For an understanding of these theorems it is desirable to 
assume that the assumptions made above are satisfied. 

According to VIII and IX a composite system consisting of two elementary 
systems (1) and (2) may be most simply described by means of either the 
Schrodinger picture or the interaction picture, since in both the Hilbert space 
can be described as a product space = jf j x 2 independent of the 
time. Here the effects associated with systems (1) and (2) have the form 
Fj x 1 and 1 x F 2 and are time independent in the Schrodinger picture. 
In the Schrodinger picture the operators for the ensembles are time dependent 
and have the form (X (2.2)) 


W t = U t + W 0 U t , (1.3a) 

where 

U, = e im . (1.3b) 

In the interaction picture (X (3.4), (3.5)) we find that 

W, = V?W 0 V t , (1.4a) 

where 


V t = U,e~ iHot = e iHt e~ iHot . 


(1.4b) 


The Schrodinger picture is particularly suited for the determination of the 
form of the operator W for an ensemble from the experimental configuration 
of the preparation apparatus. This “intuitiveness” arises from the fact that 
the mode of operation of the position operators of both particles is time 
independent and is easy to specify in a position representation, and from the 
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Figure 37 


fact that, experimentally, we know that Hj ~ 0 when the distance r 12 of the 
two particles is large compared to the dimensions of an atom. 

A scattering experiment is carried out using a preparation procedure a 
which consists of two parts (see Figure 37) in which the first part produces 
system (1) and the other produces system (2) and in which both systems are 
“shot” at each other. We shall now consider the problem of finding a mathe¬ 
matical structure to describe the physical situation described above and 
illustrated in Figure 37. 

We must therefore introduce a new structure into our mathematical 
theory. To this purpose we shall introduce a map y of a subset n <= 2! x J' 
into 2! together with the following physical meaning (that is, with the 
following mapping principle): Let a x and a 2 be two preparation procedures 
for which the apparatuses can be jointly arranged (that is, (a u a 2 ) e n) in 
such a way as to construct a new preparation procedure a — y{a u a 2 ). 

We wish to warn the reader (without thinking carefully about the physical 
situation) to introduce the axiom that y maps equivalent preparation pro¬ 
cedures (in the sense of III, D (1.1) into equivalent preparation procedures. 
We may easily have reservations about such an axiom. Here we shall not, 
however, consider a general setting for such a combination of two preparation 
procedures. Instead, we shall only consider a subset n c of n where n c contains 
(intuitively speaking) only those pairs (a ls a 2 ) of preparation procedures 
which can be combined in scattering experiments, that is, where the prepared 
microsystems will interact (see Figure 37) and therefore will not include 
pairs (see Figure 38) for which the microsystems are unable to interact. 

Our next problem is that of discovering axioms from physical intuition 
about the connection between q>y(a u a 2 ) and <p(ai), <p(a 2 ) for the (a u a 2 ) e n c . 

We will now present some of these considerations: To this purpose we 
shall consider the special case in which the preparation procedure a x 
prepares systems of type (1), a 2 prepares systems of type (2), that is, <p(a x ) 
and <p(a 2 ) are nonzero only for components of type (1) and (2), respectively. 
For this situation we write (not quite rigorously)^!) = W % and (p{a 2 ) = W 2 
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Figure 38 


where W 1 and W 2 are operators in the Hilbert space 3% for (1) and 3% for (2), 
respectively. 

According to VIII, §2 the effects of systems (1) in the Hilbert space 
= 3% x 3% of the composite systems will be described by F 1 x 1; 
similarly, for systems (2) will be described by 1 x F 2 (and, in the Schrodinger 
or interaction picture this form is time independent (see X, §§2 and 3)). 

If F 1 and F 2 are decision effects E x and E 2 , respectively (it suffices when at 
least one is a decision effect), then the effect that both E x and E 2 respond 
together is uniquely determined as the product (E x x 1)(1 x E 2 ) = E x x E 2 
(see IV, §1.3). 

If the registration apparatus which registers the effects E x x 1,1 x E 2 and 
E x x E 2 is placed with respect to the preparation apparatus y(a x , a 2 ) that 
the interaction of the systems (1) and (2) with the registration apparatus is 
such that, for all practical purposes, the interaction between (1) and (2) has 
not yet occurred, then from experience we may conclude that, in physical 
approximation, for W = (p(a) where a = y(a x , a 2 ) 

tr (W(E X x E 2 )) = tr (W(E X x 1)) tr(IF(l x E 2 ). (1.5) 

Here we say that the response of the two effects (E x x 1) and (1 x E 2 ) are 
“statistically independent” of each other, that is, the conditional probability 
for the response of (1 x E 2 ) provided that (E x x 1) has responded is the same 
as the probability for the response of (1 x E 2 ) without any condition: 

A(a r\b u a nb x n b 2 ) = l(anf) 0) an b 2 ), (1.6) 

where ^(ho^i) = E x x 1, il/(b 0 , b 2 ) = 1 x E 2 . From the coexistence of 
i j/(b 0 , b x ), ip(b 0 , b 2 ) and il/(b 0 , b x n b 2 ) (see IV, Th. 1.3.4) it follows that the 
relation 

i Kb 0 , b ! n b 2 ) = i//(b 0 , 6 1 )iA(h 0 > b 2 ) = E x x E 2 
holds. Thus, from (AS 2.3 in II, §3) it is easy to see that 

A(a r\b 0 ,a nb x n b 2 ) = A(a nb 0 ,a n b^^a nb lt a nb x n b 2 ) 
from which we conclude that (1.5) and (1.6) are equivalent. 
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We now define the following reduction operators R 1 and R 2 which will be 
of considerable importance later. F t -> F y x 1 defines a .^-continuous 
effect morphism L(J^) L(J% x 3%) (see V, D4.2.1 and VIII (2.2)). There¬ 
fore there exists a mi-morphism J K(J% x 3%) where 7\ = R\ (for 

the notion of a mi-morphism see V, D 4.1.1). R t is called the reduction 
operator of the composite system onto subsystem (1). By analogy with Rj 
R 2 is defined as the reduction operator onto subsystem (2). 

If we require that (1.5) holds for all E u E 2 , then for W 2 = R X W, W 2 = 
r 2 w, W' = W x x W 2 it follows that 

tr((W - W'XE,. x £ 2 )) = 0. 

For the special case in which E x = P^, E 2 = P^, cj) = cpi// it follows that 
P v x P^ = and <(/>, (W - W")^) = 0. If we replace by <p x + <p 2 and 
<Pi + i^^ndi^byi^i + i^ 2 and^ 1 + ii^ 2 it follows that <<j6',( IF — W')(f>} = 
0 for all ft = and (j> = (pi)/. Since vectors of the form (pi// span the entire 
Hilbert space, we obtain W — W' = 0, that is: 

W — Wi x W 2 . (1.7) 

In the Schrodinger picture, near the time of preparation the ensemble is 
given by (1.7) where W = <p(a) = (py(a u a 2 ). Thus we are now able to formu¬ 
late our first postulate: 

Postulate 1. There exists a time 7] for which (at least in physical 
approximation) 

W Ti = W lTt x W 2Ti 
holds (in the Schrodinger picture). 

We shall later return to the question: Which is the topology for which 
W Ti is supposed to be “approximately equal” to W 1Tt x W 2Ti in the case in 
which Postulate 1 is not applicable at a fixed time 7] but (as is often assumed 
for reasons of producing a simpler mathematical formulation) in the limit 

7J->-oo? 

Postulate 1 expresses the experimental fact that systems (1) and (2) are 
generated by means of two “separate” preparation procedures a 1 , a 2 ; that is, 
the preparation procedure a = >■(«!, a 2 ) for a scattering experiment is 
composed of two separate preparation procedures a u a 2 . It is now necessary 
to formulate, in mathematical terms, the idea that both systems are “shot at 
each other”, as intuitively illustrated by Figure 37. First we wish to exclude the 
case which is intuitively illustrated by Figure 38. First we require (both for 
the case of Figure 37 and Figure 38) that the preparation procedures are so 
chosen that at the designated time 7] the prepared systems (1) and (2) are so 
widely separated that the interaction Hj is, for all practical purposes, zero, 
that is, 

tr (W T Hj) ~ 0. (1.8) 

We could require that equality holds in (1.8), but, for mathematical reasons, 
such an assignment is awkward. It is preferable to require that the interaction 
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Hj is such that for large separations r 12 of the two systems H, is, for all 
practical purposes, approximately (but not equal to) zero. 

We could at this point introduce a second postulate using (1.8), but we shall 
first take an additional step because equation (1.8) does not express the 
condition that the systems are “colliding” (as in Figure 37) rather than 
“separating from each other” (as in Figure 38). 

In the situation described in Figure 37 the systems may (they need not 
necessarily) interact at times t > T t , that is, (1.8) need not hold for t > T t or 

tr~ 0 for t > 7] (1.9) 

need not be satisfied. In the situation of Figure 38 (1.9) remains valid for 
times t > 7j. This formulation of the distinction between these two cases is 
unsatisfactory because in both cases (1.9) is satisfied for t sufficiently large 
(that is, for t -> oo) (see §2). Therefore it is preferable to consider times t < 7]. 
Then the situations described by Figures 37 and 38 are reversed if we con¬ 
sider the time evolution in the direction t -> — oo. For t < T t (this is possible 
only in the case of gedanken experiments—see the considerations of time 
translation in VII, §§1 and 2 and the transformation into the Schrodinger 
picture given in X, §2) the situation of the “colliding systems” of Figure 37 
results in the condition that 

tr(W,Hj) ~ 0 for t < 1} (1.10) 

since (intuitively) the systems before time T t are more separated than at 
time T t . Now we come to the formulation of the second postulate: 

Postulate 2. There exists a time T t such that (at least in physical approxi¬ 
mation) the following condition is satisfied: 

tr (WHj) ~ 0 for t < 7], (1.11) 

For the notion of “physical approximation” see the remarks at the end of the 
formulation of Postulate 1. 

Since the mathematical requirement of equality in Postulates 1 and 2 is 
impossible (that is, leads to contradictions) we shall now reformulate both 
postulates, imposing postulates only in the limit as t -*• — oo. Then, for 
experiments, practical questions remain to establish finite values of time t in 
individual cases (in the laboratory scale!) for which the postulates are satisfied 
in “physical approximation”. 

For the formulation of the physical problem presented here it is meaningless 
to investigate the limit of W, for t ->■ — oo. We can show that in the case in 
which the spectrum of H is continuous that tv(W t A) -> 0 for t -> + oo (such 
limits are considered in ergodic theory). Physically we are less interested in 
the limit of W t as t -> — oo than we are in the question of whether it is possible 
to satisfy (1.11). Certainly we can investigate the limit for tr (W,Hj) for 
t -> — oo, but it is somewhat impractical. Instead of considering the operator 
(not necessarily bounded) Hj we shall consider the time evolution of W t in 
the Schrodinger picture: 

W t = e~ iHt W 0 e iHt . (1.12) 
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Since (1.11) holds for t < T t we should expect that in the following equation 
(which follows from (1.12)) for t < 7] 

W t — ll) W r 

we may approximately replace H by H 0 in the experiment, that is, we obtain 

e iH 0 ty^ e -iH 0 t _ e iH 0 t e ~iH(t-T,)^ £ iH(t-T t ) e ~ iH 0 t 

~ e iH « T ‘W Ti e' in ° T ' 

and therefore find that for t < T t the operator 

e iHot W t e~ iHot (1.13) 

is, for all practical purposes, time independent. Equation (1.13) is nothing 
other than the operator W, from (1.4a) in the interaction picture. 

It is therefore reasonable to mathematically express Postulate 2 by using 
(1.4b) by means of the following limit: 

Coll. 2. For t -> — oo for W = (py(a u a 2 ) the following limit exists 

W ( = V,' W 0 v, = e ia °’e- iHt W 0 e iH, e- iH °’ -> W l 0 (1.14) 

in the o(j%, <2>) topology where W‘ 0 e K. 

With the help of Coll. 2, Postulate 1 may be reformulated in a more 
satisfying way as follows: Since for t = 7] we obtain 

W r , ~ W‘ 0 , that is, W r , ~ e iHoT ‘W Ti e- iHoT ', 

where W Ti satisfies Postulate 1 and since e iH °‘ leaves the product representa¬ 
tion of an operator invariant, we may replace Postulate 1 as follows: 

Coll. 1 The operator Wq defined according (1.14) has the form 

W l 0 = W\ o x W‘ 2 o (1.15) 

W\ 0 and W‘ 20 are identical to the operators q)(a l )eK(M' l ) and <p(a 2 )eX(jf 2 ) 
of the Heisenberg picture of systems (1) and (2), as Q' -+ K was defined 
initially! (See III and X.) 

Coll. 1 and 2 do not, of course, guarantee that there exist any W, which 
satisfy (1.14) and (1.15). Certainly this will depend strongly on Hj. A portion 
of the discussion in this chapter is dedicated to this problem. 

We therefore consider Coll. 1 and 2 to be valid axioms for preparation 
procedures describing certain “scattering experiments”, as we have described 
above. 

Coll. 1 is nothing other than the introduction of an assertion about the 
ensembles of prepared pairs with respect to effects of systems (1) and (2), 
respectively, that is, with respect to the product representation t x 

of the Hilbert space which, according to VIII, §2, refers to the maps T { [\ T { 2 \ 
In the formulation of Coll. 1 we have assumed that the maps Tf, Tf which 
have the meaning given in VIII, §2 exist “before” the scattering. 
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Often we find that the limit (1.14) exists not only in the a(28, 2>) topology 
but also in the norm topology of 28. If W t converges in the norm topology 
then it follows that W' 0 e K. Whether (1.14) converges in the norm topology 
depends upon whether Hj decreases sufficiently rapidly with the separation 
r 12 of the two systems. For the case of Coulomb interaction which behaves 
like l/r 12 norm convergence does not occur. In this book we shall not consider 
this case further; the reader is referred to [18]. 

The norm in 28 is the trace norm, which we denote by || • • • || s . We therefore 
obtain 


l|W,-W* o ||,->0 for t -> — oo 
from which it also follows that 

II — W\\\ s -> 0 for f ->• — oo, 

where 


W\ = e~ iHot W i 0 e iHot 

because for unitary operators U the trace norm is invariant 

ML= \\uau + \\ s . 

Using Coll. 1 and using the form (1.2) of H 0 , that is, for 
H 0 = H 01 x 1 + 1 x H 02 




“ 2,1, 


it follows that 

Wi = W[, x W‘ 2t , 

where 


(1.16) 

(1.17) 

(1.18) 


(1.19) 


( 1 . 20 ) 


W\, = e- iHoi, W\ 0 e iHoit , W\ t = e- iHo2t W i 2 oe iHo2 ‘. 


2 General Properties of Effects Used in 
Scattering Experiments 

Behind the scattering zone, in a region in which after the scattering systems 

(1) and (2) are no longer in interaction the systems encounter detectors 
(counters, etc.) which register the outgoing systems (1) and/or (2). Therefore 
effects (such as those defined in III, D 1.2) will be registered. What is the 
special structure of these effects which are registered after the scattering? 

Here we do not need to introduce any new structure in the theory because 
in VIII, §2 we have already introduced the appropriate structure which 
permits us to characterize the effects produced separately by systems (1) and 

(2) . Here we must only answer the question of what we mean by the expression 
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“after the scattering” and provide a mathematical description using previously 
introduced structures. That is, we wish to develop a mathematical formula¬ 
tion for the experimental situation (using the theory presented earlier in this 
book) in which the registration apparatus is so constructed and positioned 
that systems (1) and (2) will interact with the registration apparatus only 
after no further interaction between (1) and (2) will occur. This means that 
we may compute the probabilities for the activation of the effect responses 
for the ensemble W t (in the Schrodinger picture) for times t in which the 
interaction between (1) and (2) is no longer active. 

The conditions required on W t in order to compute the effect responses 
“after scattering” are therefore completely similar to that imposed on W t 
before the scattering. While Coll. 2 requires a specific behavior for t -* — oo, 
we must therefore assume (!) that for t -*■ oo there exists a limit as in (1.14) 
for which 

W t =V;W 0 V,^W{, (2.1) 

provided that W t also satisfies the conditions Coll. 1 and Coll. 2. If such an 
assumption is not made, it is not possible to easily use a scattering theory 
such as presented here. If (2.1) is not satisfied then the simplifications of 
scattering theory presented here are somewhat unrealistic. In fact it is 
necessary to make modifications in the case in which we use composite 
systems (such as atoms) instead of elementary systems. 

If scattering theory is applicable, then for a preparation according to a 
given preparation procedure a = y(a u a 2 ) the corresponding W\ defined in 
(1.8) will be approximately equal to the correct W t (of (1.12)) for — oo < t< 7]. 
The time 7] will, of course, depend on the particular preparation procedure a, 
that is, upon W\ . From time 7] to time 7} the quantity W t may be significantly 
different from W\ and from 

W{ = e~ iHo, W f 0 e iH ° l . (2.2) 

For t > T f ,W t will be approximately equal to W{. Clearly 7} will of course 
also depend upon the prepared W\. 

Using the Schrodinger picture we have sought to derive, in an intuitive 
way, the postulates Coll. 1 and Coll. 2 and the assumption (2.1). Here the 
time parameter t plays an important role. In fact the use of the limits t -* ± oo 
is only an intuitive tool. The actual times for the preparations and registra¬ 
tions clearly are not represented by t -* — oo and t -> oo, respectively. Since 
the difference 7} — 7] are, in practice, very small (a fraction of a second) it is 
understandable that the experimental physicists are not concerned whether 
the preparation occurred before T t and the registration occurred after T f . 
They “believe” that the spatial arrangement of the preparation and registra¬ 
tion apparatus guarantees the condition that W‘ 0 is determined by a = 
y(a u a 2 ) and that the ensemble W{, may be used to compute the frequency 
distribution for the registered effects. Mathematically this problem is a 
problem about the relationship between the asymptotic behavior as 
t-* ± oo and the asymptotic behavior for large spatial separations. This 
question will be discussed in §4.7. 
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In summary we may therefore say that the simplification of scattering 
theory consists first in the fact that we may easily obtain, on the basis of the 
construction of the preparation apparatus in the form a = y(a 1 , a 2 ) and the 
arrangement of the two parts a u a 2 , a mathematical expression for 
W l 0 if (/)(«!) and (p(a 2 ) are known and second, that for the computation of the 
probability for the registered effects we need only use W{. Thus we may use 
either an “undisturbed Schrodinger picture” or an “undisturbed Heisenberg 
picture” for the investigation of the registered effects in the following way: 
For the composite system for an effect procedure ( b 0 , b) there corresponds an 
operator F = ijj(b 0 , b ), and the probability is to be computed for an exact 
W = W 0 (see X, §2) using tr( W 0 F). If F was registered after the scattering 
(which is experimentally guaranteed by the spatial arrangement of the 
registration apparatus corresponding to b 0 relative to the preparation 
apparatus corresponding to a) then tr (W 0 F) x Xx{W f 0 F) will be a very good 
approximation. If we translate the effect procedure by a time r then we will 
obtain the approximations 



tr{W 0 e iHt Fe~ iHx ) x tr (W{,e iHot Fe^ iH ° x ) 

(2.3) 

and 




tr(W 0 e iHt Fe~ iHz ) = ix{W t F) x tr (W f F\ 

(2.4) 

where 




W{ = e~ iH ° x W{,e iHot . 


Equations (2.3) and (2.4) hold for all r > 0; for r < 0 they will, of course, be 
valid only if the effects e iHoz Fe~ iH ° x displaced by t satisfy the assumption that 
the registration occurs only after the scattering, or, in experimental situations, 
what amounts to the same thing, that T t b 0 may be combined with a in the 
sense of III, §1 because the spatial arrangement (far from the interaction area 
of the two systems) of b 0 to a guarantees that either T t b 0 is always registering 
after the scattering or cannot be combined with a (see III, §1 and [6], XVI 
and [7], III, §5). For an experimental physicist there is no such problem 
because intuitively, no registration procedure will be used until the prepara¬ 
tion apparatus is “turned on”. 


3 Separation of Center of Mass Motion 


Since the interaction Hj is translation invariant, the motion of the center of 
mass can be separated (see VIII, §2). Here we will provide a list of the formulas 
which are necessary for scattering theory. 

In the Hamiltonian operator (VIII (2.12)) 


"-2 


Pl + H, 


(3.1) 
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we shall, according to VIII (2.9) and VIII (2.18), replace the operators Q x 
and Q 2 by new operators as follows 


and 


- = mjQx + m 2 Q 2 
m 1 + m 2 


Qr — Q\ ~~ 


(3.2) 

(3.3) 


where Q is the position of the Center of mass and Q r is the “relative position”. 
We shall now obtain the canonically conjugate momentum corresponding 
to Q and Q r . According to VIII (2.8) and VIII (2.13) the latter are given by 


P = P 1 +P 


!■> 


r Af 1 ’ 


(3.4) 


where M = m x + m 2 ■ Defining the reduced mass m by the equation 


it follows that 


I_ J_ 1 M 

m m 2 m 1 m 2 



(3.5) 


(3.6) 


Defining the “velocity’’^operators by (\jm x )P x — V 1 and (l/Vn 2 )P^= V 2 
we find that (1 /m)P r = - V 2 has the form of the velocity^ vector V of an 
“elementary system” with reduced mass m corresponding to g^The reduced 
mass may also be obtained if we rewrite (l/2m 1 )Pi + (1/2 m 2 )P\ using P, P r 
(see VIII (2.14)); we find that 

H-^.P 2 + Pp; + H,. (3.7) 


According to VIII (1.2) and VIII (2.17) we may write with V — Hj 


where 


and 


H = H b + Hi = H 0 + Hj, 

Hi = H i + V, H t = ±- P 2 
2m 

H 0 = H b + H h H b = —j— P 2 


(3.8) 

(3-9) 

(3.10) 
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In the sense of the product representation VIII (1.1) we therefore obtain 



U, = e iHt = e iHbt x e iHil = U[ h) x if 

(3.11) 

and 

U, = e iHot = e iHbt x e iA ‘ { = U[ b) x Uf\ 

(3.12) 

where 

U[ b) = e iHbt , 

(3.13) 


U \ 0 = e iH,t , 

(3.14) 

and 

lJf = e iA '‘. 

(3.15) 


4 Wave Operators and the Scattering Operator 

In this section we shall be concerned with the mathematical question of 
whether the limits considered in §§1 and 2 exist, and how the “jump” from 
W' 0 to W{, is to be calculated. We can only present a brief insight into a 
mathematically well-developed theory. We refer readers who are interested 
in the mathematical structure of the theory to [10] and [18]. 


4.1 Definition of the Wave Operators 

Our first problem is to investigate the limits considered in (1.14) and (2.1) as 
t-* ± oo. We shall consider both cases t— oo and t -> oo at the same time, 
and we shall distinguish between them by using the indices —, + (and 
eventually i, /). Multiple indices appearing in the same equation mean that 
the formula is valid for both indices (—, i) and ( + ,/). We shall here in¬ 
vestigate the stronger condition of convergence in the trace norm topology: 

II Ut W 0 U t — l/, + Wfr'UfWs 0 for t -» ± co. (4.1.1) 

Since the trace norm is invariant under unitary transformations, we may 
rewrite (4.1.1) as follows: 

WWo-V.Wk'Vfl^O (4.1.2) 

or 

\W t + w 0 v t - wtf‘L-0 

for t -> ±oo where V t = UUf. 

The behavior of the operator 

V, = U t Ut~ e iHt e~ iH °‘ (4.1.3) 
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is of crucial importance for the validity of relationships of the form (4.1.2). 
This operator V, is, as we have already mentioned in §1, the operator which 
describes the time dependence of the ensemble in the interaction picture. 
From (4.1.3) and (3.11), (3.12) it follows that: 


V t — (e ,Hbt x e lH,, )(e lHb ' x e lU,T ), 
V, = 1 x e iH '‘e- i6 '‘. 


(4.1.4) 


Equation (4.1.4) means that in the computation of V, we need only take 
into account relative motion in the center of mass system for the Hamiltonian 
operator H t = H, + Hj. In 3tf b (from the representation of x 

acts like the unit operator. With respect to the product representation 
x according to (4.1.4) we obtain 

V t = 1 x Q(t) with Q(t) = (4.1.5) 


This condition permits us to restrict our discussion and proofs to con¬ 
sideration of fi(f). 

Conditions (4.1.2) are satisfied if, in the sense of the strong topology (that 
is, A„ -» A for A n (p -> A(p for all cp e Jf) the following equation holds 

Q(t) -> Q ± for t -* ±oo (4.1.6) 


a fact which will be proven in Th. 4.2.2, 

In order not to excessively restrict our investigation we shall only require 
that fi(t) converges only in a subspace of .Af t ; then, since Q(t) is unitary it 
easily follows that the set of (p e ^ for which fi(r)<p converges is a subspace of 
Thus, we may write (somewhat more generally than as in (4.1.6): 

Q(t)P± -> Q ± for t -> ±oo, (4.1.7) 

where P ± is the projection operator onto the subspace where the limit of 
Q(t) exists for t ± oo. 

In order to make the formulas in the next sections more tractible we shall 
omit the indices i for H t and H t . Then we may, according to (4.1.5), write 

Q(£) = e iHt e~ iAt . (4.1.8) 

In order to make the dependence on H, H more noticeable it is desirable to 
rewrite (4.1.7) using (4.1.8) as follows: 

lim Q.(t)P ± = Cl ± (H,H). (4.1.9) 


Q ± are called the “wave operators” or “Mpller operators”. 

The projection onto the subspace upon which 0 + (t) is convergent will be 
denoted by Q ± . In the notation of (4.1.9) we therefore obtain 

fi(t)P+ -»• £I + (fl, H) for t -» ± oo, 

. (4.1.10) 

^ + (06+ H) for t -> ± oo. 





228 XVI Scattering Theory 


In physics instead of 1 x H (as an operator in M b x M t ) it is common to 
write Q; similarly, for 1 x P ± and 1 x Q+ it is common to write P ± and 
Q +. In this sense then (4.1.10) holds also if Q(t) is replaced by V t . 


4.2 Some General Properties of Wave Operators 

In order that we need not deal with unnecessarily many indices, we shall 
abbreviate (4.1.9) as follows: 

Q(t)P -> Q (4.2.1) 

from which it follows that QP = Q. Since Q (t) is unitary, for tpePM it 
follows that 


M = IIO(0?II - ll^ll, that is, ||Q<p|| = Mi¬ 
ll is therefore an isometric mapping of PM onto a subspace of M ; we shall 
let E denote the projection onto this subspace. Thus from (AIV, §7) it follows 
that 

Q + H = P and f2Q + = E. (4.2.2) 


From 


||Q(t)<p - cp || = ||cp - Q + (t)«p|| 

it follows that Q + (t) converges in EM'. In addition, for <p e PM (since q> = 
Qcp, that is, cp = Q~(p) we find that Q~(t)E converges to Q + £. 

For Q defined by Equation (4.1.10) we find that Q + (t)Q converges, that is, 
Q ' (t) converges in all of QM. Therefore Q > E. If Q # £ then Q(t) would 
converge in a larger space than PM = Cl + EM. Therefore Q = £, and we 
obtain 

Cl + (t)Q -*• Q + and QQ + = Q. (4.2.3) 

Thus we have proven the following theorem: 

Th. 4.2.1. Let Q(t) be a unitary operator, and let P be the projection operator 
onto the convergence space of Q(t). Then the limit Q = lim[Q(t)E] is an 
isometric mapping of PM onto a subspace QM. In addition QP = Q, 
Q + f2 = P,QQ + = Q, Q + Q = Q + .QM is the convergence space of Q(t) and 
Q + (t)Q-+Q + . 

Th. 4.2.2. Let W > 0, tx(W) = 1 and PWP — W. Then in the trace-norm 
topology Q(t)IFf2 + (0 -> W where W > 0 and tv(W) = 1, and W is given 
by W — QWQ = QWQ + . For each W satisfying QWQ the following 
condition is satisfied: Q + (f)lFQ(f) -> Q + IFQ. 
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Proof. We first consider the special case in which W = P v . Since PP V P = P v we 
obtain Ptp = (p and we therefore obtain SMlpp -> cp = Qcp. Since £2 is isometric, we 
obtain ||<p|| = 1. For (p(t) = Q (t)(p we obtain £2(t)P^£2 + (t) = P v(t) . We shall now 
compute HP*,, - PJ S . 

The eigenvalues for the operator P v(l) — P 9 may be obtained from the following 
equation (here we write q> as an abbreviation for <p(t)): 

PJ ~ P-J = <K<Z>,/> ~ <K0,/> = M- 

If we set (p = (p(pp, q>y + r (therefore 1 = | (Cp, <p) 2 + ||r|| 2 ) then, for the pair of 
nonzero eigenvalues A 1i2 = + ||r|| and we therefore obtain || P w) — P^|| s = 2||r||. 
Since ||r|| is the shortest distance from <p(t) to the line Xq>, we obtain 

WO - 011 > llrll 

and we therefore obtain 


II P m - PJs < 2W0 - 0||. 

Since \\<p(t) — <p|| -*Owe therefore obtain 

||Q(t)P„n + (t) - P,||, - 0. 

From cp = Q(p it follows that P 9 = £2P^£2 + . 

For arbitrary W satisfying W = PWP it follows that 

W=^P Vv , 

V 

where w v > 0, w v = 1 and P<p v — (p v . For 0(t)<Pv = <p v (0 : <P,(t) -*■ <p v = £2<j»v For 
W = Xv w,P iv we obtain W > 0, tr(#) = 1, W = Q.WQ + and 


||£2(t)IFfi + (t) - W ||, 


E Wv(^ v „) PipJ 


=£ E w vilP«, v(II - p„x 

V 

N oo 

^ E W vl|f«. v (0 - P*J\s + 2 X W v 

»=1 v = N +1 

Since w v = h we may first choose N and then choose t sufficiently large in order 
to obtain 

\\Q(t)WSl + (t) - W\\ s < e. 


Th. 4.2.2 shows that in (4.1.2) Wfr 1 = W 0 Q. ± and W 0 = Q ± W£‘Clt. 

Thus(4.1.2) is satisfied provided that P ± W{,’‘P± = Wfr ', which is equivalent 
to Q ± W 0 Q ± = W 0 . According to III, §6, using the notation in III, D 3.1, the 
condition PWP = W is equivalent to the condition WsK 0 {\ — P) = K X (P). 
Thus W -> £2IT£2 + is a bijective affine map of K t (P) onto KJQ) and W -* 
£2 + WQ. is the corresponding inverse map. 

From W 0 = VFqQI it follows that the following equation holds in the 
Schrodinger picture for all t: 

W t = e- iH ‘Sl_ 
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With the help of (1.15) we obtain 

W, = e- iH, Q_(W[ o x W^Ql e iHt . (4.2.4) 

Therefore (4.2.4) is applicable providing that P~(W\ 0 x W 20 )P+ = 
W\q x W l 20 . Later we will find that “normally” P_ = 1. 

Therefore Q_(JTi 0 x W 20 )d- is the correct operator in the Heisen¬ 
berg picture, that is, for <p(a,) — W\ 0 , (p{a 2 ) = W 20 we obtain <py(a u a 2 ) = 
Q_{W\ o x W l 20 )Q.Z. The correct operator in the Schrodinger picture is 
given by (4.2.4). Both may therefore be used for the computation for the 
probabilities for arbitrary effects, regardless of the time at which the registra¬ 
tion method is applied. 

Th. 4.2.3. If Q(t) = e iHt e ~ iHt then P&P is an invariant subspace of the group 
e lHt and is an invariant subspaceof the group e lHt . Then the following 
relationships are satisfied: Pe lllt = e'^'P, Qe lHt = e‘ H, Q, Qe l, *‘ = e lHt Q and 

n + e iHt = e i 6 t n + . 

Proof. From Q(t + t) = e iHt e iH, e~ , ^ , e~‘^ t it follows that f2(t + t) = 
and we therefore obtain 

0(t + t )e iA ’P = e im Q(t)P. (4.2.5) 

The right side converges with t (for fixed r) towards e iHz Q. The left side must 
therefore also converge; thus e^'P must lie in PjP, that is, 

Pe iAx P = e iAz P (4.2.6) 

must be satisfied for all t. Thus it follows that PJf is an invariant subspace for the 
group e iAz . If we take the adjoint operator corresponding to (4.2.6) and replace 
t by — t, we then obtain 

e ifh P = Pe if,z . (4.2.7) 

With the aid of (4.2.7) equation (4.2.5) is transformed into 
Q (t + T)Pe ith = e i,h Q(l)P 


from which we conclude that, for fixed t, in the limits t -* oo and t -> — oo 

Oe i6z = e iHc £l (4.2.8) 

We may obtain equivalent relationships if we replace H by H 0 , P by Q and Q by £T. 

If Pi < P and If (as P) commutes with e if,t and if we require that Q x be 
the projector onto Q P x 2tf, it then follows that Qi < Q. We therefore obtain 
fi+QFi = and HQ+Qi = Qj. 

From (4.2.8) it follows that 

Qe iih P l = e iHz QP l = e iHx QSlP u 
Q.e itlz If = QIfe i,h = Q&e i&t P x = Q x e int QIf. 
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Since ClPiJ'F = Q V FF, e iHx leaves the subspace Q x invariant, and we 

therefore obtain 

e iHt Qi = Q,e iH \ 

Thus we have proven the following theorem: 

Th. 4.2.4. //Pi < P and Pi commutes with e'‘° n (for all t ) (which, for the 
case in which Pj = P is satisfied according to Th. 4.2.3) and if Qi is the 
projection on QP ) ,2f', then it follows that Qi < Q and Qi#f is invariant with 
respect to e lHt and it follows that e ,Ht Q l = Qie lHt . 

Th. 4.2.5. For P 1 < P and Qi < Q defined as in Th. 4.2.4, the following 
converge in the strong topology: 


(a) 

e iHt e -i' h P a, 

e iAt e~ iHt Q-* Q + 

(b) 

e- iH, n - e dh P 0, 

e i6, e- iHt Cl-* P; 

(c) 

o' 

t 

ft, 

°5 

r 

/-—s 

1 

a 

(Q + - \)e~ ib, P -> 0; 

(d) 

(QPj - l)e ^'P, ->0, 

(Q + Qi~ l)e~ iH, Pi -* 0; 

(e) 

e^Qe-un p 

e ib, Q + e~ i6t P->P; 

(0 

^ &t QP l e~ at -*• Pi, 

e i6t n + Qie~ iA, Pi -*• P], 


For a projector Q 2 > Q u that is, for a Q 2 satisfying Q 2 QP { = QP, the 
following convergence properties are satisfied: 

(g) (1 - Q 2 )e- i6t Px 0, e i6t Q 2 e~ itl, P l P v 

Proof, (a) follows directly from (4.2.1) and (4.2.3). The relations (a) can be multiplied 
on the left (not necessarily on the right!) by a uniformly bounded operator A(t). (b) 
then follows from the first relation of (a) by multiplication from the left by e~ m ' and 
then by e 

If we then use the equation (4.2.8) in the first relation of (b), then we obtain the first 
relation of (c) since Q ~ QP. If we multiply this first relation on the left by Q + , then, 
using (4.2.2) we obtain: 

(Q + Q - Q. + )e~ ift, P = (P - Q - )e~ iA, P 

= (P - Q + )Pe~ i6 ‘ = (1 - Q + )Pe~‘”‘ 

= (1 - Q + )e- iS, P-0. 

All relations (b) and (c) can be multiplied on the right by P ,; the same is true for the 
first relations from (a). Thus we may multiply the second relation from (a) on the 
right by Q v The relations obtained in the above manner from the relations (a) and (c), 
are so simple that we shall not take note of them. 

If we multiply the first relations in (c) on the right by P u we obtain 

(Q - I )e- lfl, Pi = (Q - l)P 1 e- A 

= (QP, - l)Pe ie “ = (QP] - l)e un P l -> 0. 
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If we then multiply on the left by 0. + Q lt then from Q* Q,QP , = Q'QP, = P, we 
obtain 

(Pi - n + Q 1 )e~ i6, P 1 = (P, - * 

= (1 - a*Q l )p l e~ at = (1 - a + Q l )e~ A P 1 - 0. 

Thus we have proven the two relations (d). 

If we multiply the relations (c) and (d) on the left by we then obtain the 
relations (e) and (f). 

If we then multiply the first relation of (d) on the left by a Q 2 > Qi (that is, a Q 2 
satisfying Q 2 QPi = flPi), we then obtain: 

(OP, - Q 2 )e- l6, P i - 0. 

If we subtract this relation from the first relation of (d), we obtain the first relation 
of (g). If we multiply this first relation of (g) on the left by e‘^', we obtain the second 
relation of (g). 

Th. 4.2.6. Let Q (H, H), P and Q be defined as before; similarly, let Q,(H U H ), 
P and Q. Suppose that P, (where < P) commute with e lllt (then, according 

to Th. 4.2.4, Qx = QP 1 Q + commutes with e lHt ) and let Q x commute with P. 
Then PQ t = Q 2 will be a projector which commutes with e lHt . For P 2 = 
Q + Q 2 Q we obtain P 2 < P,, and, according to Th. 4.2.4 P 2 commutes with 
e i6t . 

Then 

Q(Hx, H)Q(H, H)Pi = Q(H U H)P 2 . 

In particular, e lH,, e ~ l '° ,< converges in P 2 .2f. 

Proof. Since e iH "e~ m ‘P -> iXH,, H),sinceQ l e iH, e~ i ^ , P 1 -> Q,Q(H, A) = Q (H,A)P 1 
and since the left-hand side of these two relations are uniformly bounded in norm, 
we obtain 

e ‘H , t g - mtpQ i e Mt e - ifhp , h)Q(H, llyP,. 

Using P 2 defined above, from PQ, = Q 2 it follows that 

Q 2 e iH, e~ i6, P 2 - Q 2 e iH, e~ i6, P l 

-» Q 2 0.(H, H)P 2 - Q 2 Q(H, H)P t = 0 

and we obtain 

e lH ' , e- iH, Q 2 e iH, e-‘ A, P 2 - Q(H U H)Q(H, H)P,. 

Since Q z commutes with e iHt it follows that 

e iH "Q 2 e~ i ‘' n P 2 - Q( H u H)Q(H, H)P,. 

Since Q 2 QP 2 = S1P 2 , from the first equation of (g) Th. 4.2.5 it follows that 

(1 - Q 2 )e~ i6t P 2 0 

and we therefore obtain 

e iHl ‘(I - Qi)e~ iA ‘P 2 -> 0. 
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Finally, we obtain 

e iH 1 i e -ifHp 2 _ a{H ^ h)P 1 . 

Therefore we find that e iH,l e u,t P 2 is convergent. Therefore, in the limit, we obtain 

i, H)P 2 = Q(f/„ //)£!(//, ft)P v 


Th. 4.2.6 may be applied to the following special case in which P x is defined 
as follows: First, we define Q x = P a Q; then Q x commutes with e iHt since 
P2P and are invariant subspaces with respect to e iHt . Let P x = 
then P x commutes with e iHt . Since PQ X = Q x , we obtain P 2 = P x , and with 
this P x we therefore obtain Cl(H x , H)P X = Q (H x , H)Q(H, H)P X . 

In the special case in which P commutes with Q, then we may set P x = P 
and we obtain P 2 = Q PQQ, and we obtain 

n(Hi, H)P 2 = Cl(Hy fl), 

where we have already used the relation: 

Q (H, H)P = Q(H, H). 

4.3 Wave Operators and the Spectral Representation of the 
Hamiltonian Operators 

In §4.2 we have discussed the general properties of the wave operator which 
follow directly from its definition. We will now consider additional properties 
of the wave operator which are related to the spectral representation of the 
Hamiltonian operator. For this purpose we shall use the theorem (proven in 
AIV, §10; see also VII, §2) that H (and similarly H) may be represented as 
follows 

*00 *00 

H = sdE(e), e iHt = e ia dE(&), (4.3.1) 

J — 00 J — oo 

whereby £(e) is uniquely determined by the operator H. 

From (4.3.1) it follows that, for cpeSi(H) (where 2(H) is the domain of 
definition for H; see, AIV, §10), the following limit exists: 

4- e iHt (p = e ilIt H<p = He m <p. 
at 

Thus it follows that the spectral family E(e ) is also uniquely determined by 
the operators e iHt (for all t !). If a bounded operator B commutes with all the 
e iHt , then for cp e 2(H), it follows from 

Be m (p = e iHt B(p 

that the left (and therefore also the right) side is differentiable, from which 
we conclude that Bq> e 2(H) and that BH = HB. Thus, together with the 
results from AIV, §§8 and 10, it follows that: 

e ,Ht B = Be lHt (for all t) is equivalent to E(e)B = BE(s) (for all s). (4.3.2) 
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Thus it follows that B commutes with each E from the complete Boolean ring 
generated by the spectral family (IV, §1.3). 

Examples of such operators B are given by the operators P ± ,Q ± defined 
in §4.1. P ± , Q ± commute with the spectral family of H and H, respectively. 
From (4.2.8), using (4.2.3) we obtain 

Qe iAt Q + = e iH, QQ. + = e iHt Q. (4.3.3) 

Let £(e), £(e) denote the spectral families of H and H, respectively. Let 
E(e) = Q E(e)Q + = QE(e)PQ + 

then E(e ) defines a spectral family in the space Qjf because Q is an isometric 
mapping of PJf onto Qjf and E(c)P is a spectral family in PJf (P commutes 
with all E(r) !). 

According to equation (4.3.3) we obtain 

| e ia dE(e) = J e ia d(E(s)Q), J s dE(s) = f e d(E(s)Q). 

Since the spectral family is uniquely determined (See AIV, §10), we obtain 

fi£(s)Q + = QE(e)PSl + = E(s)Q (4.3.4) 

(since E(s) commutes with Q, E(e)Q is clearly a spectral family!). Thus, from 
(4.3.4), we obtain the following theorem: 

Th. 4.3.1. The spectral families E(e)P in P#f and E{e)Q in are unitarily 
equivalent. 

According to IV, §2.5 and IX, §§2 and 3 E( s) defines a “spectral measure” 
on the set of real numbers which divides the set of real numbers into the set 
k d of the discrete spectrum, k cc the absolutely continuous spectrum, k sc the 
singular continuous spectrum and k 0 where the latter is the complement of 
k d u k cc u k sc = Sp (H) (the sets k d , k cc , k sc are disjoint). We shall denote the 
projectors E(k d ), E{k cc ), E(k sc ) by P d , P cc , P sc , respectively. Clearly P cc + 

Psc + Pi =1. 

We shall denote the corresponding quantities for the spectral family 
£(e) by an additional index ° as follows: P d , P cc , P sc , respectively. 

Each spectral measure E(k) may be decomposed into an absolutely con¬ 
tinuous part E cc (k) = E(k n k cc ) = P cc E(k), a singular continuous part 
E Jk) == E(k n k sc ) = P sc E(k) and a discrete part E d (k) = E(k n k d ) = 
P d E(k) as follows: 

E(k) = E(k n k cc ) + E(k n k sc ) + E(k n k d ). 

The projections P cc , P sc , P d are uniquely determined (see IV, §2.5) because 
P cc consists of all vectors (p in Jf for which |j£(fc)^o|| 2 is absolutely con¬ 
tinuous with respect to Lebesgue measure; P d consists of all vectors <p in 
for which the function ||£(e)<p|| 2 is constant except for jump discon¬ 
tinuities ; P sc consists of all vectors q> in for which || E(e)(p || 2 is continuous 
but \\E(k)(p\\ 2 is singular with respect to Lebesgue measure. 
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We shall now use the following abbreviated notation: 

P i = LP, Pi = P*P, P 3 = P*P 
and 

Q i = QP 1 n + , Q 2 = np 2 n + , q 3 = qp 3 q + . 

Then, since P cc + P sc + P d = 1 , we obtain 

P1 + P2 + P3 — P, 61 + 62 + 63 — 6- 
Thus it follows that 

Cl&Jk) Q + = QEJk)P l Q + = E(k)Q 1 

and therefore that E(k)Q i is absoluely continuous (because for each 
( peQ £(fc) 6 i<P = O.E cc (k)Q + (p is absolutely continuous), that is, 
Qi > P cc - Therefore since 

£(/c)< 2 i = £(£)£« 61 = E cc (k)Qi 

it follows that 

QE cc (k)n + = EJM t . 

In a similar manner we obtain 0 2 < P* c and 0 3 < P d . Thus, multiplying 
6 = 6 i + 62 + 63 by P cc it follows that 

QiPcc = 6i = ^6- 
Therefore we may also write 

QEJk)Q- = EJk)Q. (4.3.5) 

D 4.3.1. A wave operator Q (H, H) is said to be normal if P cc < P, that is, 
fi(H, H) is defined at least on P cc . 

In Th. 4.2.5 we may therefore consider the special case in which P t = P cc 
and Qx = P CC Q. From (4.3.5) it then follows that 

Th. 4.3.2. //Q is normal, then the spectrum of El in P CC Q ( that is, the 
absolutely continuous part of the spectrum of El which lies in Qyf ) is 
unitarily equivalent to the absolutely continuous spectrum of 6. 

D 4.3.2. If Q (H, Pi) and il(H, El) are normal, that is, P > P cc and Q > P cc 
then we say that Q (H, H) is complete. 

For a complete operator Q, equation (4.3.5) therefore holds in the special 
form 

nEJk)a + = EJk), (4.3.6) 

that is, 
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Th. 4.3.3. If Cl is complete, then the absolutely continuous parts of the spectrum 
of H and 11 are unitarily equivalent. 

Th. 4.2.6 can easily be formulated for the special case of normal operators 
as follows: 

Th. 4.3.4. If Cl(H, H) and Q(//j, H) are normal operators, then the following 
equation is satisfied: 

CKH U H)Cl(H, H)P CC = Cl(H u H)P CC . 

Proof. Since Cl(H, H) is normal, then P cc < P, and we may therefore set P 1 = P cc in 
Th. 4.2.6. Then <2, = < P cc . Since £2(7/,, H) is normal, P > P cc > Q i and 

therefore P commutes with <2 1 • Thus we obtain Q 2 = PQ = Q, and we finally obtain 

P 2 = Pi = P cc . 

Th. 4.3.5. If Cl(H, H) and Cl(H u H) are complete, then Cl(H u H) is also 
complete and satisfies the equation 

CKHu H)P CC = Q(Ih, H)Cl(H, H)P CC . 

Proof. Since Cl(H, H) and Q(ifi, H) are complete, Q(//, H) and Q(H, H ,) are normal, 
and we may use Th. 4.3.4. We obtain 

Cl + (H U H)P Uc = Q(H, H,)P Uc = Q(H, H)Q(H, H t )P Uc 

thus the domain of £2 {A, H j) is larger than P lcc ^f and Q(Pi, /I,)is therefore complete. 


4.4 The S Operator 

Here we shall only consider the S operator for the case in which both £2 + and 
Q . are complete wave operators. According to Th. 4.2.2 for each W satisfying 
P CC WP CC = W it follows that Q(f)tFQ(f) -> W = CI+ WCl^ f° r t~* ±°° 
where tV = P CC WP CC . The same situation also holds for each W for which 
P cc WP CC = W: Cl ft)WCl(t) -> Qt WQ ± for t-+ ±oo. According to (4.1.2) 
we may replace W by Wfr 1 (and W by W 0 ) in the above relations. For 
satisfying P cc Wf l P cc = W{, J we therefore obtain 


W f 0 -‘Qi = W 0 

(4.4.1a) 

Qj;iFoH ± = W{’\ 

(4.4.1b) 


At the end of §2 we noted that the “jump from W‘ 0 to W{” is of crucial 
importance in scattering theory. From (4.4.1b) it follows that 

W f o = 
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If we substitute the following relation (obtained from (4.4.1a)) 

W 0 = 

into the above, we obtain 

W f 0 = 

D4.4.1. The operator S = QjQ ~P CC is called the “Scattering Operator” 
or the S operator. 

Since we have assumed that Q ± is complete, S is therefore a unitary 
operator which maps P cc into itself. If P cc = 1 then S would be a unitary 
operator in all of If Q ± is not complete, then Q ± could be different and 
we must carry out investigations which are similar to those carried out in 
Th. 4.2.6 in order to describe the behavior of the operator Q+Q _. We therefore 
see that it is an important problem to determine the conditions under which 
complete wave operators exist. 

For the case of a complete wave operator we may, with the help of the S 
operator, write 

W f 0 = SW l 0 S + . (4.4.2) 

According to the discussions at the end of §2 we may therefore describe the 
scattering process as follows: Before time T t (which of course, depends upon 
W l 0 ), that is, for t < T t we may replace W t in the Schrodinger picture by 

W t ~ <T id 'WV d '. (4.4.3) 

For t > T t the expression (4.4.3) for W t is not correct. Since, however, W t is 
prepared for t < T„ the knowledge of the expression (4.4.3) is sufficient for 
the description of the preparation. 

The “error” caused by the use of e iHt instead of e iHt in the time evolution 
of W t is corrected by assuming that “at time t = 0” W makes a sudden tran¬ 
sition from W l 0 to W{ according to (4.4.2), and that afterwards, the time 
evolution is computed with e'^\ whereupon for t > T f we obtain the correct 
W t as follows: 

W t ~ e - iAt W f 0 e iAt = e-^’SW'oS+e 16 '. (4.4.4) 

Using (4.4.4) we may compute the probabilities of “effects F registered after 
the scattering” as follows: 

tT(e- i6, SW i 0 S + e iAt F) (4.4.5a) 

(4.4.5a) holds in the Schrodinger picture, where F is assumed to be constant 
in time. In the Heisenberg picture F changes under the time translation of the 
effect apparatus by time r according to the expression 

F z = e if,t Fe~ ilh 
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and the probability is computed according to the equation 

tr (SW^S+FJ, (4.4.5b) 

where this result is valid only for values of t for which the approximation 
of the “free Heisenberg picture” is valid. 

In the special case in which F x is an effect of system (1) (or (2)) then we 
replace F t by F r x 1 (or 1 x F r ) in (4.4.5b) where the product representation 
of jr=Mf’ 1 xje 2 is the same as that before the scattering (see §1)! The fact 
that this situation is not changed is obtained from the transformation 
property of W l 0 in StV l 0 S + . If we fix the ensemble, that is, if we rewrite (4.4.5) 
in the form 

tr (W" 0 f t ) with F t = S + F z S (4.4.5c) 

then, in the sense of VIII, §2, the product representation is transformed by S, 
that is, the maps T[ f) , T ( / ] defined in VIII, §2 are obtained from Tf, Tf as 
follows: 

T\ f) g i = S + (rp gi )S, HP0 2 = S + (Tfg 2 )S, 

respectively, where 

gqeLi, g 2 e L 2 . 

Th. 4.4.1. S commutes with 
Proof. From Th. 4.2.3 it follows that 

Se if “ = Sl + + Sl_P cc e i6 ‘ = Sl + + Sl_e i6, P cc 

= Q z e iH, Q.P cc = e i6t nla_P cc = e ,6 ‘S. 

According to Th. 4.4.1 equation (4.4.5) can be rewritten as follows: 

tr (Se~ iA, W i 0 e iA, S + F) = tr(e~ ifl, W i 0 e u ° , ‘S"FS). (4.4.6) 

From Th. 4.4.1 it follows that the jump from W‘ 0 to W{, can take place, with 
the aid of the S operator, not only “at time t = 0” but also at any time r, 
because, according to (4.4.4), for t > 7}, we obtain 

W, ~ e^ i6, SW i 0 S + e i6t = e~ iA(, ~ T) Se~ i§T W l 0 e il)z S + e iA(t ~ 

Thus there is no particular time point “zero” for the application of the 
scattering operator, but only the two times T t and T f (where T t and 7} depend 
upon Wp), so that (4.4.3) can be used for t < 7} and (4.4.4) can be used for 
t > T f . Therefore it is not physically meaningful to speak of a “time point of 
scattering”. 

4.5 A Sufficient Condition for the Existence of 
Normal Wave Operators 

According to the discussion of the previous section, it is possible to determine 
the structure of a “scattering theory” providing that the existence of the S 
operator is assured. This, in turn, depends upon the existence of complete 
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wave operators. According to D4.3.2 a wave operator Q.(H, H) is complete if 
both Q (H, H) and Q(H, H ) are normal. Thus, all of the above points to the 
question of finding conditions which insure that a wave operator Q(JT, H) 
(or Q (H, Hj) is normal. This is therefore the central question of scattering 
theory, a problem to be discussed following the discussion of the “formal” 
properties of the wave operator. 

In this section we shall provide sufficient conditions for the existence of 
normal wave operators and we shall provide a number of interesting con¬ 
clusions. The following theorems will be formulated somewhat generally; 
they will, for the most part, only be used to show that a wave operator is 
normal. 

Th. 4.5.1. Let P, as before, denote the projection onto the domain of definition 
of □(//, H). To a projection P 1 there may exist a fundamental set X from 
P (that is, the closed subspace [X] generated by X is P 1 J^f ; see AIV, §2) 

which satisfies; the following properties: For each tpe X there exists a x 
for which e~ l ^‘(p e S>(H) n S>(H) for t > x (here 3>{H) is the domain of 
definition for the Hermitian operator H—see AIV, §10) and Ve" 1 ^' is con¬ 
tinuous in t for t > x and [| Ve^- l '"(p\\ is integrable over tfrom x to co (here 
we have used the abbreviation H — H = V where V = Hj was defined in 
§3). We then obtain P x < P. If the assumptions hold for P } = P cc , then 
Q.(H, H) is normal; if the same is also true for P 1 = P cc and if we exchange 
H and H (and replace V by { — F)), then Cl(H, H) will be complete. 

Proof. From <p e X and from e ifn <p e ®(/f) r\ 2(H) and Sl(t)(p = e iHt e~ if “(p, 
according to §4.3 it follows that 

- Q(t)<p = ie m \H - H)e- ie, '<p = ie iH ‘Ve ^‘(p. (4.5.1) 

dt 

Since Ve~‘^‘ was continuous in t, this is also the case for the right-hand side of (4.5.1); 
we therefore obtain 

J % t 2 

e‘ H, Ve~ i6, (p dt. (4.5.2) 

h 

Thus it follows that 

m 2 )<p - OfiM < r II Ve- iA, cp\\dt. 

i 

Since, by hypothesis, W e ~ iA,( P\\ is integrable from x to oo, the right-hand side will 
be arbitrarily small providing that t 1 ,t 2 are chosen sufficiently large. Therefore Q.(t)cp 
converges for t -> oo : 

Q(t)(p -> Q.(p, 

that is cp e PJf. Since this is true for all <peX, £l(t)(p converges for all <p in the space 
spanned by X (see §4.1) and therefore for all (p e P v yf. Therefore P > P t . 

With the aid of Th. 4.5.1 for certain “interactions” V it may be proven that 
Q(H, H) is normal. Corresponding to §3, we set H = H i and II = H,-according 
to (3.9) and (3.10). 
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It is easy to see that the operator H t = ( l/2m)P 2 has an absolutely con¬ 
tinuous spectrurp for e > 0, and we obtain P cc = 1. A fundamental set X for 
is given by the set of all vectors (with different a): 

cp(r -a) = (4.5.3) 

In order to eliminate unnecessary numerical factors, we shall set m = j, in 
order that H = — A. Then e^ ilh (p(r — a) describes the “force free” motion 
of a wave packet, the position of which at time t — 0 is given by (p(r — a). 
According to the discussion at the end of X, §1 

e~ iA ‘(p(? - a) = (1 + 2itr 3/ V (l? - S|2)/[2(1+2i,)] . (4.5.4) 

We may also consider the case in which the particles have spin. The Hilbert 
space XCi for the relative motion is, according to VIII (2.23), given by 
= 2(' rb x where ^ = t ls x t 2s . Here M) b is the “relative orbit space” 
for which the vectors (4.5.3) form a fundamental set and is the finite¬ 
dimensional (!) spin space of the particle pair. A fundamental set for :2(\ is 
obtained from the set of all vectors of the form 

(p v (r - a) = e~ (lt ~ al2)/2 u v , (4.5.5) 

where the u v (v = l,.... n) form a basis for #C S . Thus (4.5.4) is simply multi¬ 
plied by the w v . 

The interaction operator V can be written as a matrix operator V vfl where 
the V v/1 act as operators in Jf rh as follows: 

Ve~ im <p v (r - 3) = £ (V MV e- iAt <p(r - 5))u„. (4.5.6) 

We shall at h rst assume that (as is already implicit in (4.5.6)) that wave 
packed of the form (4.5.4) belong to the domain of definition of the operators 
V llv . Thus they are elements of 2(H) n 2(H). 

From (4.5.6) it follows that (assuming (hat the are orthonormal): 

\\Ve~ iAt (p v (r - a )|| 2 = £ f| Kv e ~ iAt (p(r ~ «)l 2 d ^ (4.5.7) 

H J 


Since, for arbitrary b fl > 0 (£„ b^ 2 > 6 2 , we find that || Ve ojj — a)|| 

is integrable over all t from t = — oo to oo provided that the 


||F„ ve -^W-a)|i 





-iftt 


cp(r — a )| 2 d 


V 


11/2 


(4.5.8) 


are integrable over all t. Now that we have seen that, due to the finite di¬ 
mensionality of M' s , that it is “mathematically” sufficient to consider only the 
case without spin, we shall omit the indices in (4.5.8) in order to simplify the 
notation. 

We shall now assume that the operator V is of the form 



(4.5.9) 
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Then, for 

t l/,(r) = Ve~ lfl, (p(r — a) 
and from (4.5.4) it follows that 


iWr) = 


From 


fo(r)(l + 2 it) 3/2 - (1 + 2 it) 5/2 X p »(r)( x » - «v) 

v= < 

_-(|?-a| 2 )/[2(l+2i()l 


= (1 + 2it)“ 3l2 v 0 (r)e ~ (l? ~ 3| 2)/[2< 1 + 2iI)1 , 

V, 2) = (1 + 2it)~~ 512 Y V v(?)( x v ~ a v )e _( l ?_ “l 2)/[2(1 + 2i ')], 


it follows that 

ll<M < ll<A! 1) ll + W 2) ll- 

For arbitrary e, where 0 < e < 1 it follows that 

t/ 0 (f) \r — a| <1_6>/2 


tAJ 0 = (1 + 2 it)- 1 ' 


e/2 . 


|r — 2| (1 ~' i>/2 (1 + 2if) (1 ~ e>/2 


-(|?-«P)/[2(l + 2ti)] 


X u v (r)(x v - a v ) 


l /'( 2> = (1 + 2it) _1 


- 1 -e/2 v 


-ap- 


0/2 


p-ap- £)/2 (1 + 2it) (3 


- e )/2 


,-(|f-a|)/[2(l + 2 it)] 


Since y (1_£)/2 e _ 5,2/2 < c, and }’ (3 ' :,/2 e ^ ,2/2 < c 2 (for real v > 0) where c, and 
c 2 are suitable constants, we may estimate (where A > 0 may be chosen 
arbitrarily): 


(1) for \r — a| > A 


\^t l) (r)\ < (1 + 4t 2 ) 


2 \- 1 / 2 — e/4 « 


l»o(r)l 


|^< 2) (r)| < (1 +4t 2 ) 


-1/2-e/4. 


Ir-ap-^ 2 ’ 

VX kODi 2 


“ 2 p-ai (, - £ > /2 ’ 


(2) for | r — a | < A we simply obtain 

l<TOl<(l +4t 2 )- 3/4 |t;o(r)l, 


|^ 2) (r)l < (1 + 4t 2 )- 5/4 |r - 3| ^XPv^l 2 
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Thus it follows that 



dt 


exists if there exists an e satisfying 0 < £ < 1 for which the integrals 

3 



, I k(r)l 2 

j\r — a\>A I? ^1 


a\<A 


|r - a| 2 £ \v v (r)\ 2 d 3 r 


(4.5.10) 


exist. The first conditions from (4.5.10) state that the functions v 0 (r), v v (r) 
must decrease more quickly, as |r| -> oo than |r|“ l . The Coulomb potential 
does not fall within this case, a fact which is not mathematically pleasant. This 
fact is, however, not of particular physical importance since there is no 
experiment for which we encounter arbitrarily large values of |r|, that is, 
distances between the scattered particles. It is frequently useful to multiply 
the functions v 0 (r) and t\(r) by a factor where ot" 1 corresponds to the 

maximum distance of the microsystems in actual experiments, oT‘ is 
therefore an experimentally determined macroscopic distance. For the 
mathematical problem of the limits t -> +oo in the case | v(r) | ~ | r |'~ 1 we 
refer readers to [18]. 


Th, 4.5.2. If V is bounded, then for each <p e PW' we obtain 

||Q<p — Q(t)(p\\ 2 = 2 Im {(p, e iJ ^Q + Ve~ iflz (p ) dx. 

Proof. Since V is bounded, 2(H) = 2(H) and, therefore for <p e 2(H) we 
obtain e 2(H) n 2(fi). In addition Ve~ i ^ , cp is continuous in t, and (4.5.2) 

therefore holds for each (p e 2(H). We obtain 

r t2 

Q(t 2 )<p — £2(t jfrp = i e iH, Ve~ l ^‘(p dt. 

•'ll 

Since 2(H) is dense in and V is bounded, this equation holds for all q> e Jt. In 
particular, for <p e P2C, for t 2 -* oo we obtain 

f oo 

e lHz Ve~‘^ T (p dx. 


Thus from 

||n<p - £2(t)<p|| 2 = ||£2(p|| 2 + |[fl(t)(p|| 2 

- 2 Re<£2<p, £l(t)(p} = 21| (p || 2 - 2 Re<fi<p, ft(t)<p> 
= 2 Re<ft(jo, (2<p — ti(t)<p) 
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it follows that 


||Q<p — f2(t)<p|| 2 = 2 Reji <<p, £l + e lHl Ve~ l ^ x ( t o'} dtj 

= —2 Im (<p, e‘ ,,x Q + Ve~ l ^ x (p} dx, 


where we have used Th. 4.2.3. 


4.6 The Existence of Complete Wave Operators 

In this section we shall seek to find sufficient conditions for V in order to 
insure completeness of f2 (H, //). To this purpose we shall now prove the 
following theorem: 

Th. 4.6.1. If V = cP f (||/|| = 1), then Q±(H, H) will be complete. 

We shall carry out the proof of this theorem in several steps, in which we 
first prove a special case of Th. 4.6.1. 


Case (a). 


Here P = 1. 


= J£ 2 ( — oo, oo), H(p(x ) = x(p(x). 


Proof. Clearly S)(H) = @(H). We therefore use Th. 4.5.1. From 

pOO _ 

Ve~ ifl, <p(x) = cf(x) f(x')e~ ix '‘<p(x’) dx' 
j - 00 


it follows that 


\\Ve- m '<p(x)\\ = |c| 


/•OO 

f(x)e~‘ x '(p(x) dx . 


We now consider only functions q>(x) and / (x) which are twice differentiable and 
for x -* +oo decrease towards zero sufficiently fast that the Fourier integral can be 
integrated by parts twice 

/•OO 1 poo 1 poo 

e~ ix ‘g(x) dx = : e ixt g'(x) dx = — 5 e ix, g"(x) dx. 

J- oo ItJ- oc (lt> J-ao 

Then it follows that \\Ve~‘^‘(p\\ is integrable for both t -> oo and t -> — oc . Such 
functions (p(x) are dense in .Jf and therefore for £1(H, H),P— 1. 

Since C2 exists, according to Th. 4.5.2, for all cpeJf we obtain 

poo 

||Q<p - Q(t)<p|| 2 = -2 1m J <rp, e i6x Q + Ve~ ifi » dr 

/•OO 

= 2c Im J <<p, e i6x Q + fXf e~'%> dr (4.6.1) 

< 2|c|[ J" |< 4 >, e i6x Cl + f}\ 2 dxj' 2 [ J" |</ f c-'S>| 2 drj 1/2 . 
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This estimate is meaningful only if the integrals over t in the last row exist. To show 
this we may, in general, estimate 


f \<g,e i Hy\ 2 dx 



|2 


e ixt h(x)g(x) dx 


dx 


/» 00 

= 2 n \h(x)g(x)\ 2 dx < 2ng 2 \\h\\ 2 , (4.6.2) 

J - 00 

where, for the Fourier integral we have used the following general result: From 
u(k) = ,— f e ikx v(x) dx 

y/2n J-oo 


it follows that 




|y(x)| 2 dx. 


Here g s is the effective supremum of g(x). In (4.6.2) we may exchange the estimates of 
g and h. The integrals over t therefore exist if one of the two functions g and h (not 
only belongs to Jzf 2 ( — oo, oo)) but is essentially bounded. 

From (4.6.1) for an essentially bounded <p(x) with (p s (since l|fi‘/ || < ||/|| = 1!) 
it follows that 


l|£2<Z> - 0(t)v>l! ^ 2|c|(27c) 1/2 <j9 s 


r 


</,e !iit (py\ 2 dx\ 


11/2 


From which it follows that 


\\0.(t 2 )(p - II < \\Q(t 2 )(p - Qq>\\ + || G.(p - Q(ti)^|| 


< 


(87t) 1/4 |c| 1/2 (p s 1/2 | ^\<f,e-^cpy\ 2 


dx 


1/4 


+ 


[f 


!</> e~ iHt cpy\ 2 dx 


' 1/4 


(4.6.3) 


This inequality holds, according to Th. 4.5.2, for all q oeJtf’ (since P = 1!) with finite 
(p s ; it is only proven for such f(x ) which were twice differentiable and vanish 
sufficiently quickly for x -> + oo. The right side of (4.6.3) is defined for all f e 
a result which follows from (4.6.2) for finite q> s . We are therefore led to the conjecture 
that (4.6.3) holds for all / e Jdf —here we note that Q(t) also depends upon /! 

If f e JP, then there exists a sequence /„ (||/J = 1) which satisfies || /„ — f\\ -*• 0 
for which (4.6.3) holds for each /„. We set H„ = A + cP Jn and Q„(f) = e iHn ‘e~^ u . We 
then obtain 

\\H n - H\\ = \c\\\P fn - P f \\ <2\c\\\f n - f\\. 


We will now show that ||I2„(t) — fl(/)j| -*• 0: 

We obtain 

_ e iHt e -i6t || = || e i H n t _ e iHt | = _ J|| 





4 Wave Operators and the Scattering Operator 245 

For a 1 1/ e @(/7„) = 2t(H) we obtain 

4 {e iHnt e~ iH, \ji) = ie iHn ‘(H„ - 
at 

and we therefore find that 

-i p = i f e iHn \H„ - H)e~ iHt il/ dx. 

Vo 

This equation holds for all te X since the operators which are found both on the 
left and right side are bounded. Therefore for || ^ || = 1 we obtain 

\\e iHn, e~ iH, il/ - n < f' ||(H. - H)e- iH 'n dx < f \\H n - H\\ dx = t\\H n - JJ||. 
Jo Jo 

Thus from 

£Xt 2 )(p - 0(£i)<p = (Q.(t 2 )<P - 0„(t 2 )<p) 

+ n(h)<P ~ 0„(£ ] >P) + (0„(t,>p - Q(t t )(p) 

we obtain 

\\&n(t2)<P ~ &n(tl)<P\\ -+ 

For an integral on the right side of (4.6.3) it follows from the triangle inequality that 

r*oo ~11/2 r /»oo o ~11/2 

J \<fn,e~ t6x q>}\ 2 dx^ - |J |</ e~ iH *(py\ 2 dtj 

< | </„ - /. e~ i6t (py | 2 rfrj < (27f) 1/2 ||/„ - f\\<p s . 

Thus (4.6.3) is proven for all /e Jtf’ (||/1| = 1). From (4.6.3) and (4.6.2) it follows that 
Q(t)tp converges for all tp ex’ with finite <p s . Since these q> are dense in JC, we have 
proven that P — 1 in Case (a). 

Case (b). H cc is unitarily equivalent to the operator H defined in Case (a). 
V = cP f for arbitrary f eJf. We obtain P > P cc . 

Proof. We set / = g + h where g = P cc f Then for a cp e P cc : we obtain 

\\Ve- th ‘(p\\ = |c||</ e~ l6, P cc (py\ = \c\KP cc fe- A <py\ = \c l<<?, 

Since H in P CC J/P is unitarily equivalent to the Pi in (a), equation (4.6.3) holds as before 
with g instead of / and for (peP cc JH with finite <p s . Thus it follows that P > P cc . 

Case (c). Let S be a Lebesgue measurable set on the reals. Let = PP 2 (S); 
H(p(x) = x(p(x), H = H + cP f . Then P = 1. 

Proof. We will shortly find that A has only an absolutely continuous spectrum. 
Jf” can be considered to be a subspace of H? 2 ( — oo, oo) as follows: 

= J5? 2 (S) = P s Se 2 {- oo, oo) 
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where P s x(x) = (z(x)for x e S; 0 for x$S}. Let Pt'x( x ) = x%(x) for x(x)ei? 2 (— oo, co). 
Therefore P = P s H'P S . We set H' = ft' + cP' f as an operator in i? 2 ( —oo, co) for 
an f e P S JP 2 ( — oo, oo). Then the limit 

Q C p = lim £2(0<p = lim 

exists for all <p e SP 2 (—co, oo), and therefore for all <p e For <p — P s q> it follows 
that 

Q<p = lim e iH ^ itr, P s (p = lim e iH , P s e~ i6, P s (p 

since e~‘^ ' commutes with P s and e i( ' r ‘ is identical to e^‘ in Jtf. Since P s f = f the 
operator e iH t does not lead us out of JK Therefore we obtain 

e iHt P s = P s e m P s - 

Therefore in sXP we obtain 

Q.cp = lim e iH, e~^“<p. 

Case (d). H cc is unitarily equivalent with an operator H, as we have assumed 
in (c). H = ft + cP f . Then P > P cc . 

Proof. JP = P CC JXP © ftP cc = ft cc . As in (c) we extend the space P CC JP to the 
space <£\- oo, oo) and consider the operators ft' and H' = ft' + cP f . As in (b) we 
decompose f = g + h where h e #P r and g e £P 2 (— oo, oo). Then g is an element of 
the subspace P cc #C of £P 2 ( — oo, oo). According to (b) the limit of Q(t) exists in all of 
i? 2 ( — oo, oo) whereby, according to (c) Q(t) transforms the subspace P CC #P of 
«S? 2 (— oo, oo) into itself since g e P cc .TP. 

Case (e). H 0 is arbitrary, H = ft + cP f . Then P > P cc . 

Proof. Let 3tf 0 denote the subspace of spanned by all the 6(e) f (— oo < e < oo) (see 

IX, §1). Thus f eJ>P 0 ; for each “spectral measurable” function <j>(s) for which 

^\4>(e)\ 2 d\\E(B)f\\ 2 < x, 


it follows that 


g = j <j>(e) dE(e)f e 3f 0 

J^ 0 is therefore an invariant subspace of e‘ f ", but also of e‘"‘ since f e TP 0 . 

We set = 3tP 0 © Then is therefore also an invariant subspace of e‘^‘ 
and e tH ‘ and therefore of Sl(t). Since f eJiP 0 , it follows that (see above (c)) e iHt % = e i ®‘x 
for all x £ and we therefore obtain Si(t)x = X- For P, as the projection onto 
■Jfi P[ commutes with P cc and we therefore obtain 

lim fl(t)Pi^c = ^Pcc P l = Pcc P i- 

In order to prove that lim Sl(t)P cc exists, it is therefore only necessary to show that 
the following limit exists: 

Sl(t)P cc (l - Pr), 

that is, that lim Q(t)P cc (p exists for all <pe 


Putie. TfegiAniH aLtmi 



4 Wave Operators and the Scattering Operator 247 


We therefore need only consider the operators e iHl and e‘"‘ in 3%, . We shall denote 
these operators by e iH '‘ and e ifr ‘. P cc { 1 - P t )tf = P cc 3^ 0 is then the subspace of the 
absolutely continuous part of A', since P cc and (1 — Pi) commute. We will show that 
A' cc is unitarily equivalent to the operator A given in (c), and, according to (d) 
lim Sl(t)P cc (p exists for all <p e ,3f 0 . 

3% is spanned by the vectors £(«)./• Let / = g + h where g = P cc f. Then for E(e)g 
we obtain 

P cc £(e)g = £(e)P cc 9 = E((.)g. 

The P(s)g and £(s)h span two mutually orthogonal subspaces of 3% which together 
span all of 3%, . Each vector of the subspace spanned by E 0 (s)g is absolutely continuous, 
while the vectors from the subspace spanned by the E 0 (e)h are orthogonal to P cc 3€. 
The vectors £(s)g therefore span P cc Jf 0 . 

All the vectors (peP cc 3^ 0 may therefore be represented (see IX, §2) by functions 
4>(s) which are defined on a Lebesgue measurable set S from — oo < s < oo for which 

f \p(s)\ 2 de < oo, 

4s 

where H 0 <j>(e ) = &p(&). Therefore the problem posed above corresponds to case (d) 
above. 

We now prove Th. 4.6.1 as follows: 

For Q(H, A) we therefore obtain P > P cc . Since ft + = Q(A, H) we may apply (e) and 
we obtain Q >P CC ■ Thus Sl(H, A) is complete. 

Th. 4.6.2. If V = £" = i then 0(7/, H) is complete. 

Proof. Let = A + £? =1 c v P Vv - Then H = H„ and H„ +1 = + c„ +1 P^ +1 . It 

therefore follows that all the Q(A I1+1 , //„) are complete. 

By finitely many applications of Th. 4.3.5 it follows directly that Q (H, A) is 
complete. 


Th. 4.6.3. Let H = J e dE(e) and <p e P CC JP. For 

„ „ 2 J d\\E(e)(pf 

Ml = ess sup%-— 

it follows that for all p e 

p 00 

K</6 e~ iHt (p}\ 2 dt < 2n\\(p\\ 

J ~ 00 


Proof. We begin with 


<i l/,e~‘ H, (py= e dip, E(e)<py. 

" — 00 


Since <p is absolutely continuous, ip, E(e)(p} is absolutely continuous; we therefore 
obtain 


dip, E(e)(p} 


dip, E{e)(py 

-7- de 

de 
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from which it follows that 

J-00 

If d(ip, E(e)(py/de is quadratically integrable with respect to e, it follows directly (as 
in (4.6.1)) that 

r ,<*. ,-«.»>!■ *-2 ,r 

J — OO J — 00 I ds 

For i/>' = P cc ip (since <p is absolutely continuous) it follows that 

(tp, E(e)(py = <$', E(e)(p} = <£(e) )ip', E(e)q>y. 

Thus, for each measurable set k we obtain 

<*, E(k)(py = <mw, m<py 

it therefore follows that 

\<*P> E(k)(py\ 2 < \\E(k)ip'\\ 2 \\E(k)q>\\ 2 . 

Since ||£(fe)^'|| 2 is also absolutely continuous, it therefore follows that 

I r d(ip, E{e)cpy ± 2 ^ r d\\E(em 2 dc MI|F(£)ffl[| 2 ± 

j ds Jjj ds Jk d& 

Since the above result is valid for every measurable set k, it follows that (almost 
everywhere) 

d(ip, E(e)cpy 2 ' d\\E(e)iP'\\ 2 d\\E(eM 2 
ds ~ de de 

From which we conclude that 

= Ms w\\ 2 < imi 2 \m 2 - 

Th. 4.6.4. If V is a trace class operator, then Q(H, H) is complete. 

(From Th. 4.6.4 it follows, according to Th. 4.3.3, that the absolute con¬ 
tinuous parts of H and H are unitarily equivalent.) 

Proof. We need only show that 0(H, 6) is normal, then the same is true if we replace 
H with fit for 0.(6, H ) = 0 + (H, 6) and it immediately follows that 0(H, H) is 
complete. 

We have K= £"=1 c v P Vv where |c v | < co. Let V„ = £? =1 c v F„ v , H„ = 
6 + V„, Q„(t) — e iHnt e~ lf “ and let 0„ denote the limit of 0„(t). 

According to Th. 4.6.2 the Q„ are normal. According to Th. 4.5.2, for all <p e P cc #f 
we obtain 

II O n cp - n„(t)^H 2 = -2 Im C (<p, e A n:V n e-^<pydx. 
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For 


<9, e^n;v„e-%y= £ c v <p, 

v= 1 

it follows that (using Th. 4.6.3) 


I«.9» - vfOvW 2 < 2 

8 [X| II 

1_J 

*00 

\<<p, e iAx at<py\ 2 d% 

t 


00 

J>-'J 

*oo ~i 

\(<p v ,e~ iflx (py\ 2 dx 1 

< 2(2rt) 1/2 ||<p|| 

/ oo \ 1/2 

i £ k v |) 


00 1 

Z kvl 
_v= 1 J 

\v= 1 / 

*00 ~1 

\f(p„e~ iflx (py\ 2 dx\ 


Thus we obtain the following inequality 

mt 2 )<p - OJttM < (8tt) 1/4 ll<pL 1/2 ||F|| s 1/4 [r(t 2 , <p) m + r(t u «p) 1/4 ] 
with trace norm 

nn. = ikj, 

V 

where we used the following abbreviated notation: 

r(t,<p)= £ |c v | f \f<p v ,e~ i6x <p}\ 2 dx. 

V= 1 Jt 

Since H — H„ + (V - V„) and \\V - VJ 0 it follows that (as shown above) 
|| e‘ Ht — e iHn, \\ -> 0 and 10(0 — 0„(t)|| -*■ 0. Thus it follows that 

m 2 )<P - ^(0)^11 < (87r) 1/4 ||^||i /2 || F|| s 1/4 [r(t 2 , <p) 1/4 + r(t u <p) 1/4 ]. 

Since 

r(t,<p)<2n\\(pU\V\\ s 

it therefore follows that r(t, <p) -> 0 for t -> oo. Thus lim O (t) exists for all <p e P cc #f 
having finite ||<p|| s . It only remains to show that the set of all <p with finite ||<p|| s is 
dense in P CC M: 

For p(e) = d | E{e)(p\\ 2 jdr we obtain j ” m p(e) de — ||<p|| 2 thus for fc„ = {e | p(e) > n }: 
E(k n )(p 0- Thus it follows that (p n = (1 — E(k„))<p -+ rp and ||p„|| 2 < n. 

Th. 4.6.5. Let H — (l + V, let V be a trace class operator and, let fife) be a 
function which satisfies the following conditions: There exists a partition of 
— oo • • • oo into finitely many intervals such that fife) is differentiable in each 
interval, fi'(s) is continuous, of finite variation in each finite sub-interval 
and fi’(e) > 0 (therefore fi’(e) # 0!). Then Q(fi(H), fi(H)) is a complete 
wave operator, and for all (p e P cc M J the following equation is satisfied: 

Q(fi(H), fi{fl))<p = Q(H, H)cp 
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Proof. We must show that 


lim gWaXg-wfoji 


exists and is equal to Q(H, H)P CC . Since is (uniformly in t) bounded, it suffices 
to show that 

e -m6) , p cc _ H)P CC 0. 

From (4.3.6) it follows that 

S1(H, ft) J d£(e)P cc = JV'** dE(e%l(H, A) 

and we therefore obtain 

H) = Si(H, 

Thus it remains to prove that 

[1 - Q(H, tiy]e-^ 6), P cc -> 0. 

It suffices to show that 

[1 - Q (H, -> 0 

holds for a dense set of vectors <p in P cc Jt. 

Using the last estimate from the proof of Th. 4.6.4, for t 2 -> oo it follows that: 

l|Ofl» - 0(0*11 £ Ml l2 (MV\\ s ) m r(t,cpyi 4 . 

For the particular case in which Q(0)<p = q> we obtain: 

11(0 - 1)<P!I < I! «>L 1/2 (87r||F|| s ) 1/4 r(0, <p ) m • 

Since 

||£(6)p|| 2 = We-^'fcM 2 

we obtain 

Ml. = \\e-‘^‘P(e)cp\\ s 


and we therefore obtain 

Il[l - S1(H, < l<p|| s 1 / 2 (87t|!Fy 1/4 r(0,e- w %) 1/4 . 

We therefore need only show that 

r(0, e- i,l,(6), (p) -* 0 

fort -» oo. We obtain: 

00 /*00 

r(0, e~ m6), (p) = £ |c v | |<<p w e _i ^ t_w *V>| 2 dr. 

v= 1 Jo 

As in the proof of Th. 4.6.3 we obtain 

<fl> v , P d/ ^^ {£)(p > dz, 

i tI p 
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where d<(j» v , £(s)<p>/de is a quadratically integrable function satisfying (since 

!l<Pv!l = 1): 


J 


d<q> y , £(e)<p> 


de 


de < 


If we show, that as t -> oo each integral 


|<«js> v , e~ ,6x i<wfl) ty>| 2 dx -> 0, 

•^0 

then we also obtain 

r(0, e- mA), (p) - 0 

since (using Th. 4.6.3) 

r(0, e-^') < f |c v | 2 tc M 2 + £ |c v | P |<<j» v , e-^-^'cp}! 2 dr. 

v = N+l v -1 Jo 

Therefore, it only remains to show that, for 



e ‘^ e >‘g(e) de 


and a quadratic integrable function g(e.) 



dr ->0 


for t 


00 . 


/( t , t) is, as a function of t, the Fourier transform of the function e~ i,He) ‘g(e). If R is the 
projection 


R/M = 



for t < 0, 
for t > 0, 


then Jo \f(t,r)\ 2 dr -> 0 is equivalent to the statement that Re'^^'g -> 0. If this 
is the case for a basis set of vectors g{e) in the Hilbert space of quadratically integrable 
functions of e, then it holds for all quadratically integrable functions g(e). It therefore 
suffices to choose g(e) to be characteristic functions for arbitrarily small intervals. 
Since the cj)(s) may be decomposed into finitely many intervals in which it is con¬ 
tinuously differentiable, the characteristic functions may be so chosen that 4>(e) is 
continuously differentiable in the intervals of the characteristic functions. For such 
a characteristic function we obtain (for r > 0 and t > 0): 


/(t, t) = J e ~ ■«-■</>(£)( de = i J (T + f ,£'(£))-» ~ (g-to-wor) de 

[ -Ux-icl>(e)nb r-b J 

I fit, t) | < (r + t^'(a))' 1 + (t + tfiibyy 1 + f \d(r + 1 \. 

Ja 


We obtain 


|(t + f<j>'(«i))~ 1 - (r + 1 ! = - 0'(Si)Xt + f^'(Si))~ V + t(t>X8 2 )r 1 1 

< t|0'(£2> - i)I(t + ct) 2 , 
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where c (> 0)!) is the minimum of </>'(fi) in the closed interval [a, 6], Therefore we 
obtain 


f 

•'a 


M(t + t(j>\e)) *|< 


Mt 

(t + ct ) 2 ’ 


where M is the total variation of </>'(<:) in the interval [a, 6]. Thus it follows that 
|/(t, t)| 2 < 2(t + ct) -1 + Mt(r + ct) 2 


< 2(t + ct )~ 1 + Me 1 (r + ct) 1 


2 c + M 
c(t + ct) 


and 


I 


|/(t,r)| 2 dr< (2c+ , M)2 -^0 


ct 


for t -* oo. 

We shall begin by choosing fl = (l/2m)P 2 and H = 6 + V where F is 
such that H is bounded from below. We see, that 

Cl(H, #) = Q(H + otl, fi + al) 

and that the statement “Q(H, H) is complete” is equivalent to the statement 
“S1(H + al, H + al) is complete”. We can choose a such that, to H a = 
H + al, = H + al there exists a d > 0 such that H x > 51 and > 51. 
Thus it follows that H ~ 1 and H ~ 1 are bounded operators. 

Th, 4.6.6. IfHis a trace class operator then Q(H, H 0 ) exists and 
is complete. 

Proof. According to Th. 4.6.4, for H' = H ~ 1 and H' = fl~ \ Q.(H', fl') exists and is 
complete. cj>(e ) = — g" 1 for s > d (with the above 5) and 4>(e) = e for e < S satisfies 
the conditions from Th. 4.6.5. Therefore Q.Uj>(H~ 1 ), ’)) = Q(—H a , —BJ is 

complete. Since —ftj = Jit follows thatfl(H 2 , HJ = Q (H,H 0 ) 

is complete. 

Now let Fhave the form assumed in §4.5 where i\,(f) = 0 in (4.5.9). For the 
sake of simplicity we will now assume that V is bounded. Then, H = H + V 
is (trivially) bounded from below. We may therefore apply Th. 4.6.6 providing 
that H~ 1 — H~ 1 is a trace class operator. 

From H x = H x + V it follows that H~ ! H rj = 1 — H~ 1 V. Since V and 
H a 1 are bounded operators, H~ l H x is also a bounded operator. Further¬ 
more it follows that 

h;' - H-' = -h;'vh;' = 

Since H~ l H< x is bounded, H~ 1 - H~ 1 is a trace class operator if H a 1 VH~ 1 
is a trace class operator. Let V + be the positive and let (— V_ ) be the negative 
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part of V. Since H x 1 is positive, ff a 1 V + H a 1 is the positive and H a 1 (— V. )H y 1 
is the negative part of It is therefore sufficient if 

tr(H“ 1 V + fl~ 1 ) < co 

and tr(H“ 1 V_ H~ ‘) < oo. This is the case if the condition 

Jtr s (K±(r)) < oo 

is satisfied where tr s is the trace in spin space (see §4.5). 

Then we find that 

tr(fl; 1 V ± H; ') = Jtr s (F ± (r)) d 3 r 

Here it is possible (and of considerable mathematical interest) to improve 
the results described above (see [10] and [18]). However, with respect to the 
physics described here, no new essential results are obtained, because the 
interaction operators between two elementary systems (satisfying VIII (5.8)) 
do not satisfy all of the conditions for the existence of a complete wave 
operator. In §§4.5 and 5.2 we have easily obtained such an operator by means 
of a “trick” in which we multiply the interaction operator Fby e~ ar (where 
a > 0 is very small). If there is a singularity at r = 0, we may modify V for 
r < very small) such that V is bounded. Such modifications must not 
change (in physical approximation) the spectrum of H nor the wave 
operators; otherwise, the Hamiltonian operator defined by VIII (5.8) must 
be considered to be physically unusable. Naturally, the mathematical problem 
which reflects the above approximation remains to be formulated. 

4.7 Stationary Scattering Theory 

Earlier, in §§4.1 to 4.6, we have outlined the problem of the wave operator 
and scattering operators in terms of “time limits”. We shall now refer to a 
number of results of the so-called stationary scattering theory without giving 
precise descriptions of the mathematical problems to be found there. This 
section shall only serve to establish the connection between this method and 
other methods. 

We shall begin with (4.5.2). Here we shall assume that the limit 
fl(t) Q- exists in all of Jf, that is, that P = P cc = 1. Then in the limit we 

obtain 

Q_ = 1 + i f e iHt Ve' i6t dt. (4.7.1) 

Jo 

Similarly, by exchanging H and H, we obtain 

/»— 00 

a* =Q - i\ e ift, Ve~ iH, Q dt, (4.7.2) 

Jo 
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where Q is defined in §4.2. If Q is complete, then Q = P cc . From Ql Q_ = 1 
and <2Q_ = Q , from (4.7.2) it follows that, by multiplication with 

= 1 + i f e iAt Ve im n_ dt. 

Jo 

From (4.2.8) it follows that 

Q_ = 1 + i f e iA, Vn_ e- iAt dt. (4.7.3) 

Jo 

Completely analogous equations may be derived for fi + . Equation (4.7.3) for 
Q_ has the form of a stationary integral equation because t is only an integra¬ 
tion variable. It is useful to multiply the integrands in (4.7.3) by a factor e m 
and then to take the limit 0 < a -> 0. We obtain 


/*— 00 

= 1 + lim i e iA, VQ.-e~ i6,+c “ dt. (4.7.4) 

a^O Jo 

Equation (4.7.4) has, as a consequence, that it often remains meaningful 
when applied to improper eigenvectors of H. 

Beginning with the improper eigenvector 

in the position representation where u e Jtf., and using the notation 

fc(r) = n_<r|*>« (4.7.5) 


then from (4.7.4) it follows that 

'I'df) = < r\k}u + lim i 

rx~*0 Jo 


0 (iti-i(,k 2 /2m)+x)t 


dtvuo 


and we therefore obtain 

iff) = <.r\k}u - lim — iajVil/dO- 


(4.7.6) 


Equation (4.7.6) is known as the Lippmann-Schwinger equation. In the 
position representation it follows that 


lim ( r 

a-0 


(H-f-i.) 1 = lim - JL f 

\ 2m ) 7 (27r) 3 J k’ 2 


e‘ 


,ik'(r' — r) 


■ ,2 /2w — k 2 /2m— ict 
This is precisely a Green’s function of the differential operator A — k 2 l: 

k 2 


d 3 k'. 


lim ( r 

a-*0 \ 




m e 




r ; = 


2n\r - ? 


Thus (4.7.6) is transformed into 

1 - m Ce ik 

-2i J>VW> dV. 


(4.7.7) 


(4.7.8) 
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where V as an operator also acts in spin space because must, according 
to (4.7.5), be considered as a vector in spin space: 

Mr) = X 'Pkv(f)u v , 

v 

where i fa v (r) 6 and the u v is a basis in 3tf s (see §4.5). 

From (4.7.8) it follows that t Mr) 1S an eigenvector of H with eigenvalue 
k 2 /2m: 

HM^) = (H + V)Ur) = ~ Mb (4-7.9) 

The above equation is only an alternative description of the relationship 
(4.3.6) making use of improper eigenvectors. 

The solution (4.7.8) of (4.7.9) has the following form: For large r = |r| we 
obtain the asymptotic solution: 

1 1 /A e ikr 

^ ~ (XrF e * ' U + (XO 571 9 n \r) (4 ' 7 ' 10) 

Equation (4.7.10) is called the “radiation condition” for the solution 
of (4.7.9). This condition states that i/^r) differs from a plane wave by the 
addition of an out-going spherical wave. The intuitive meaning of (4.7.10) 
is very clear: If we consider, in the Schrodinger picture (4.2.4) an ensemble 
Wq in Jfj of the form W l 0 = P^ where 

Hr) = ^372 | e ‘ k f X “vXvW d 3 k 

then W t will, according to (4.2.4), be equal to P lj/t where 

Mr) = f M(r) X KX v {he~ i(kll2m)t d 3 k. (4.7.1 1 ) 

J v 

For t < 1} (7- is defined in §1) we approximately obtain 

Mr) » X w vZv(fc) d 3 k, 

that is, a free moving wave packet. Upon reaching the region of interaction 
in the vicinity of \r \ = 0 spherical waves are produced. For t > 7} we again 
obtain free motion, given by 

H(r) ~ (^372 | e« n - f -« 2l2mW X «vXv(*) d 3 k 

1 C g i ( kr ~( k2 l 2m W /s\ 

+ (2n) m J - 7 -? m,[ r jyJh d 3 k. (4.7.12) 

The solution of (4.7.9) is uniquely determined by the condition (4.7.10), that 
is, we obtain (4.7.8), where the latter is nothing other than (4.7.3) since Q_ is 
determined if C2_e i)E ' F is known for all k. 





256 XVI Scattering Theory 


The problem of solving (4.7.9) subject to the condition (4.7.10) is often 
called the problem of stationary scattering theory. In §6 we will discover the 
practical value of the solutions i 

The fact that for many practical purposes the knowledge of the operator 
Q_ (instead of S = D+Q_) is sufficient is a direct consequence of (4.7.10). In 
the same way we obtain the following result for Q + : Q + applied to a “free” \jj 
produces a wave packet which has the same outgoing spherical wave as \J/. 

•Ad)') = is therefore a solution of the free Schrodinger equation (that 
is, of (4.7.9) where V = 0) which is identical to with respect to the out¬ 
going spherical wave. Therefore, for for large \ r\ we obtain: 


ridf) 


£ h 

(2n)^ 

kr u + 

1 

(r\ 

+ 



1 


C 2*) 3/2 

st ~ ikr 




(4.7.13) 


where htfr/r) is a yet-to-be-determined amplitude of the incoming spherical 
wave. The scattering operator S is determined by 


m v(?) = S 


' 1 

( 2 W 2 



(4.7.14) 


For the effects to be measured after the scattering it is only necessary to 
consider the outgoing spherical wave portion of the wave packet 

JZ mv(r)e~ ik2/2m]t yjk) d 3 k. 

This outgoing spherical wave portion is identical to the last term on the 
right-hand side of (4.7.12). For the application of scattering theory it is 
sufficient to know the amplitude gtff/r) of the outgoing spherical wave in 
(4.7.10), the knowledge of which is somewhat less than that of Q . (see §6). 


4.8 Scattering of a Pair of Identical Elementary Systems 

Earlier, for the sake of conceptual simplicity, we considered only the scattering 
of two different elementary systems. In fact most of the considerations of the 
previous section did not depend upon this special assumption. We need only 
to make a few minimal changes in order to apply the above methods to the 
case of two identical systems. 

Equation (1.15) from Coll. 1 is to be replaced by an operator W l 0 from 
as follows 

K = «[W"io x W‘ 20 + W‘ 2 o x W\ 0 ], (4.8.1) 

where a is a normalization factor, so that tr(W 0 ) = 1 where tr is the trace 
in{JT?} ± . 

Here it is important to choose such W\ 0 and W‘ 20 which are mutually 
orthogonal (that is W 10 W 2 o = 0), since both preparation procedures a t 
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(where <p(a t ) = W\ 0 ) and a 2 (where cp(a 2 ) = W 20 ) must be combined to a 
single preparation procedure, as described in §1. We have to assume that 
(a l ,a 2 )en c implies that (p(a ,) and <p(a 2 ) are mutually orthogonal. Many 
pseudoproblems will arise if this condition is ignored. 

Otherwise, all other considerations remain the same except for the fact 
that for the space ^ this subspace of M J rb x must be chosen for which the 
operator R x P s from VIII (3.12) takes on a (+1) eigenvalue for Bose 
systems and a (-1) eigenvalue for Fermi systems. Then the operators Q ± are 
defined in as in the previous sections. 

In §6.3 we shall provide an example of a scattering process with identical 
systems. 


4.9 Multiple-Channel Scattering Theory 

Up until now we have only considered the mathematical structure associated 
with scattering only for the case of elementary systems because in that case 
it is easy to obtain a general overview. With respect to the totality of experi¬ 
ments, the cases of scattering of two composite systems or of an elementary 
system with a composite system play a much more encompassing role. 

The complication of the scattering theory of composite systems rests upon 
the fact that there are multiple “channels”, that is, after the scattering, a 
system which is different than the incoming systems may be present. 

A system composed of N elementary systems can often be decomposed into 
noninteracting parts in several different ways if these parts are widely 
separated. The entire system consisting of N elementary systems may, for 
example, be decomposed into / parts in which the ith subsystem is such that 
it consists of n t elementary systems which are “bound” together. Here we 
would therefore obtain N = Yj=i n i- 

A system composed of n ; elementary systems is said to be bound if the 
ensemble W satisfies the condition 

W= (1 x E d )W( 1 x E d ), (4.9.1) 

where 1 x E d refers to the product decomposition VIII (1.1) and the pro¬ 
jection E d is the projection onto the subspace of the discrete spectrum of the 
Hamiltonian operator H { in VIII (1.2). 

Here we have used E d as a characterization for the definition of the “bound” 
state W according to (4.9.1). There are alternative characterizations (see, for 
example, [18]) which are, under certain assumptions about the interaction, 
equivalent to (4.9.1). 

In order to demonstrate a decomposition of a system consisting of N 
elementary systems into “channels” we shall choose N = 3 and assume that 
all three systems are of different types. We shall write the Hamiltonian 
operator in the following form 

H = H 0 j + H 02 + H 03 + F 12 + V 23 + V l3 , (4.9.2) 
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where H 0i is the kinetic energy of the ith subsystem and V ik is the interaction 
operator between the ith and icth system. 

Let E dik denote the projection onto the appropriate subspace of J^x 
into which (after separating out the motion of the center of mass of the two 
systems) the Hamiltonian operator H ik = H 0i + H 0k + V ik has a point 
spectrum. The subspace of the continuous spectrum of H in the space 
/ = can be split into “channels”, that is, into the following 

subspaces as follows: 

Channel /_: All three systems are free. 

is the subspace of all the cp for which the limit 

)/x = lim e iHa ‘e~ iH ‘(p, (4.9.3) 

t-+ — 00 

where H 0 = H 0i + H 02 + H 03 exists. 

Channel (12) _: (1, 2) are bound, 3 is free. 

j£J 12 )- is the subspace of all cp for which the limit 

lim e « H ° +v '* e -iH,(p (4.9.4) 

t~* — 00 

exists and (E dl2 x 1)^ = i//. 

The channels (13)_ and (23) _ may be defined in a completely similar way. 
Similar definitions can be made for the limits t -> oo. 

The subspaces J^ 12) _, <^ 13) _, j^ 23) _ are pairwise orthogonal. In 
each of the channels an operator QjL, 2) _, etc. is defined by (4.9.3), 
(4.9.4), etc. The domains of fij _ _, Q ( ^ 2) _ J^ 12) , etc., are not mutually 

orthogonal (by choosing suitable assumptions about the interaction, the 
domains may all be equal to Jtf). Thus, in this case, a unique operator 
Q_ Wilt as was the case in §4.2 does not exist. How do we select the “correct” 
Wq in the case of multiple channel theory? 

As in the simpler cases which we have seen in §1, this is not a mathematical 
question. It is a question of a new structure in which several preparation 
procedures are combined to form a new preparation procedure. As in §1, the 
individual preparation procedures must be such that, if the corresponding 
ensembles are extrapolated backwards in time, that there is, for all practical 
purposes, no interaction between the individually prepared systems. Thus, 
by analogy with §1, we are led to introduce the following extension of Coll. 1: 

If three preparation procedures a u a 2 ,a 3 (where a k is preparing elementary 
systems of type k with (p(a k ) = W 0k e .^(J*f k )) such that the systems are 
“shot” at each other, then we set 

W 0 = Q f (W 0l x W 02 x W oi )QJ_. 

If two preparation procedures a l2 and a 3 where a 3 is preparing elementary 
systems of type 3 with <p(a 3 ) = W 03 e&(Ji( 3 ) and a 12 is preparing bound 
states of a pair of elementary systems 1, 2 according to 

^(**12) = Woi 2 S^(^l X Jfr!) 
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such that the two systems are “shot” at each other, then we set 
Wo = ^(l, 2 )-(Woi 2 x Wo3 )Q ( ^ >2 )_ • 

The asymptotic measurement of effects after scattering can already be defined 
with the aid of W 0 if we know the asymptotic behavior of W 0 . In general this 
problem, of course, cannot be simply analyzed, as, for example, in §4.7. Later 
we shall only give an example (see §5.3) because, we cannot give an overview 
of the most important applications (for multi-channel theory see, for example, 

[19])- 

5 Examples of Wave Operators and Scattering Operators 

We shall now illustrate the general theory presented in §§1-4 by the use of a 
few examples. 


5.1 Scattering of an Elementary System of Spin j by an 
Elementary System of Spin 0 


In §§3 and 4 we have not taken into account an important symmetry—the 
rotational symmetry of the interaction V. In Jf?; = Jf rb x t l/2 we therefore 
find it necessary to make use of the reduction of the rotation group which was 
carried out in XI, §10. In XI (10.21) and XI (10.22) we obtained vectors which 
belong to an irreducible representation of the rotation group. These two 
vectors differ in their symmetry under reflection r -> — r. The scattering 
operator S must therefore commute with the operators of the representation 
@tj and the reflection operator. 

Since S also commutes with H, we use the (improper) eigenvectors of H. 
Let g kl (r ) denote the solution of the equation (see XI (3.4)): 


J_d_ / 2 dg k \ 
r 2 dr \ dr ) 


1(1 + 1 ) 

' 2 


9u = k 2 g kl . 


Then the functions <p[ J M given by 


(5.1.1) 


and 


= g kl (r)[ . I 1 t, m 1 Y' m u + + ILa Yi + 


<PfcmV 1/2 2 — 9kl+ l( r )( — 


21 + 1 


ll — m + 1 
/“ 2 TT3^ _ 


21+1 


Yl +l u + + 


+ 2 Y l+ 1 u \ 

I 21 + 3 m+1 -j 


are improper eigenvectors of S, where the eigenvalue does not depend upon 
M ; we obtain 

ScpVu = e i2 ^ k V k j M , (5.1.2) 
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where the quantity is called the phase shift, where the latter is, of course, 
only determined up to integer multiples of n. The factor 2 in the exponent is 
chosen by convention. 

The g k ,(r) may be directly expressed in terms of Bessel functions. Later we 
shall see that they may also be obtained by using other methods than solving 
the differential equation (5.1.1) directly. 

If the interaction V (approximately) does not depend upon spin, then 
Sij{k) will be independent of j. In that case, we write <5 i; (/c) = <5 ( (fc). 

In order to actually compute S we must obtain a method of computing the 
d,j(k). We will now carry out this computation for the case in which the 
interaction V arises from a potential function V(r). 

In order to determine Q . we now seek a solution i //jff) of equation (4.7.9) 
which satisfies (4.7.10). First we seek a solution of (4.7.9) of the form ri kl (r)Y l m . 
Then it follows that (see the derivation of XI (3.4)) 

llr) + ^ t 2 ^ T,kl + 2mV ^ rikl = k2y,kl ' ( 5 ' L3 ) 

Setting u kl = rri kl it follows that 

« 

- u h + ^ 1 + 2 ^ u kl + 2 mV(r)u kl = k 2 u kl . (5.1.4) 

r 


Thus it follows that if V(r ) decreases more rapidly than 1 /r as r 
will behave asymptotically like 

u k i(r) ~ c sin(kr — ^ n + $,(k) 


oo then u kl 
(5.1.5) 


Later we shall show that £,(k) has been defined in such a way that 5, = 0 
for V = 0. 

The B,(k) will be determined by (5.1.5) up to integer multiples of n because 
only solutions u kl of (5.1.4) may be used which are regular for r = 0 (see also 
XI, §3). 

The desired solution i //rfr) must therefore have the form 

v-' u k i(y) i . „ . 

L a lm - Y‘ m (d, Ip), 

im r 

where for the sake of simplicity the polar axis for 8, <p is chosen to be in the 
direction of k. For large r, according to (4.7.10) we must have the asymptotic 
behavior 


V a Ukl Y l 

la a lm 1 m 


ikr 


l, m 


(2tt) 3/2 


e ikf + 




In order to determine the a im , we need to expand the plane wave in terms of 
spherical harmonics 


e* r = e ik '™ e = f g kl (r)Y l o (0). (5.1.7) 

1 = 0 
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From (5.1.7) it directly follows that g k ,(r ) must be the solutions of (5.1.1). We 
may therefore obtain these solutions in the following form 


gjr) = 2n J e ikri Y l 0 (0 d£, 

where £ = cos 6. For Y l 0 given by VII (3.55) it follows that 

0 ki(r) = 


21+1 1 
' 2 2 ' 


w: 


,ikrS 


d l 


d? 


(1 - Z 2 Ydl 


(5.1.8) 


(5.1.9) 


From which we may obtain g u by repeated integration by parts. Here we 
shall only carry out the first integration by parts; we obtain 


9ki(r) = 


'21 + 1 1 1 


2 27! 


r 

! ikr J _! 




where 0(l/r 2 ) means that additional terms decrease at least as rapidly as 
1 /r 2 as r -> oo. We therefore obtain the following asymptotic behavior: 

9ki(r) ~ 2 yfn V2/T7 (- i)‘ ~ singer - ^ nj. 

We may choose the yet undefined constants in tj kl such that rj kl has, according 
to (5.1.5), the asymptotic behavior 

dki(r) ~ ~ singer - l - n + 3,(k)j. 

In this way the constants a lm in (5.1.6) are uniquely determined. It follows 
that 


Q 


■((2 


Mr) 


2 Jn 


= j? 0 fc #+i(-o^ ,,( von(0). (5.i. 


10 ) 


Thus f k (6) is also determined by (5.1.6); we obtain 

f k (0) = (5.1.11) 

i-o K 

We shall now determine S. Since must be a solution of (4.7.9) we may 

now write 

Mr) = fj>i9ki(r)Y l o(6), 
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where b, are undetermined coefficients. Asymtotically it therefore follows 
that 


nX'I'itf) ~ ^ 3/2 Z o b,Y l 0 (6)2 y /n Jll + l(-i)'^; s in^r - yj. 

The outgoing spherical waves of H +1 p^r) and 1 j/rfr) must, however be identical 
from which it follows that 

b t = e 2iSl(k \ (5.1.12) 

Therefore we obtain 

S ((2^ e ' k r ) = (27tP ,f 0 e2 ‘ a,( '‘ ) ^fW y o( 0 ) (5U3) 

from which it follows that 

SfeMnW = e 2i *'< k >g kl (r)Y l 0 (d), (5.1.14) 

that is, $,(/c) is identical to the quantity <5,//c) = <5,(/c) defined in (5.12). 

Therefore the scattering operator S can be obtained by solving the equation 
(5.1.4) and by computing the phase shifts determined by equation (5.1.5). 


5.2 The Born Approximation 


The approximate solution of (4.7.8) obtained by an iteration scheme is known 
as the Born approximation. Here we shall only consider the first step for the 
case in which V(r) is a potential. The first iteration step consists of replacing 
i//i£r) on the right-hand side of (4.7.8) by (l/(2n) 3l2 )e ikf . It then follows that 


U0 = 


m 


(2n) 312 (2ri) 


5/2 






V(r')e ik f ' d 3 r. 


For e = F/|F|, for large |F| we obtain the asymptotic behavior: 


e ik\r-r'\ ~ e ik\f\-ikf'-e 

In this we obtain the f k (9 ) defined in (5.1.6): 

m = - ~ J V(r')e ikf -^-W, 

where e 0 — k/\k\. Since 6 is the angle between k and Fit follows that 


m 


= — 2m * —— f r’V(r') sin(kr'je 0 — e|) dr’ 

k\e 0 — e\ Jo 


m 


k sin( 0 / 2 ) 


/» 00 

r'V{r') 

•JO 


sin( 2kr sin -1 dr. 


(5.2.1) 
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We shall now consider the example of Rutherford scattering; in particular, 
we shall consider the scattering of an electron by a nucleus of charge Z, 
that is, V(r) = — Ze 2 lr. In order to make it possible to apply scattering theory, 
and in order to obtain better descriptions of real experiments (see §4.5), we 
shall replace V(r ) by V(r) e~™ and let a -* 0. Then we obtain 


m = 


mZe 2 

2k} sin 2 (0/2) 


(5.2.2) 


which should, except for small d, be a good approximation. For 6 close to 
zero it is, however, extremely difficult to distinguish the scattered systems 
from nonscattered systems (see §6). 


5.3 Scattering of an Electron by a Hydrogen Atom 

Here we will now give an example of a multiple-channel scattering experi¬ 
ment which will not only illustrate the general discussions of §4.9 but will 
also serve as a demonstration for the discussion to follow in XVII, §§6.2 
and 6.4. 

The problem of the scattering of an electron by a hydrogen atom will be 
simplified in that we shall consider the nucleus as a fixed stationary field of 
force. The fact that this is permissible in the case of scattering experiments 
because of the large mass of the nucleus (this is also true for the case of light 
emission; see XI to XIV) will be justified in more detail in §6.2. 

The Hamiltonian operator has the form XII (2.1) where Z = 1. We will 
neglect the effect of spin. In stationary scattering theory (that is, in the 
determination of Q_) we shall seek a solution <f> of 

H<t>(h, r 2 ) = (^ + E^ru r 2 ) (5-3.1) 

of the form 

4>±(r i, r 2 ) = y ik fl (p 1 (r 2 ) ± e ik r2 (p i (r l ) + Xk(fi, r 2 ) 1 (5.3.2) 

where describes the outgoing spherical wave. Here (p,(r) is the eigen¬ 
function for the ground state of the hydrogen atom where the energy 
eigenvalue is E v 

We shall now determine the form of W‘ 0 according to (4.8.1). To this 
purpose we shall make assertions about W[ 0 and W‘ 20 , that is, assertions 
about how the experiment is to be carried out. 

We shall assume that the colliding electrons and the hydrogen atoms are 
“unpolarized”, that is, the following condition with respect to the product 
representation of the Hilbert space of an electron holds: 

w \o = Wiot X 1, W l 20 = iP^H) X 1 
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Since we may write 

W\ 0b = Z WvP*, 

V 

where 


Uh) = je ik fi xM d 3 k 

it suffices to only consider the case W\ ob = P^. According to (4.8.1) we 
obtain 

W' 0 = «[(P, x PJ x (1 x 1) + (P qi x P+) x (1 x 1)] (5.3.3) 

with respect to the product representation {M’l x t 2 ) using the nor¬ 
malization factor a. As we explained in §4.8, we may assume that i// is ortho¬ 
gonal to q> v Then, according to (5.3.3) W ! 0 as an operator in 

Wl x t 2 }„ = + x {t 2 }„ © {^l)_ x {i 2 } + , 

will have the form 

W 0 = i[/V + x P 0 + P^_ x Pj], (5.3.4) 

where 


<A+(r 1; r 2 ) = - 7 =W(r 1 )<p 1 (r 2 ) + <Pi(FiW 2 )X 
V2 


<A-(^i, r 2 ) = —7=(^(ri)<j9i(r 2 ) - <PiO'i)iA(r2» 
V2 


where P 0 is an operator in and is the projection onto the subspace 
with total spin 0 and P t is the projection onto {4 2 } + with total spin 1. 

From the form (5.3.4) we suggest that it is reasonable to seek solutions of 
the form (5.3.2) because it will then be possible to determine the operator 

W= 

In order to obtain solutions of (5.3.1) of the form (5.3.2) it suffices to first find 
solutions 4> of (5.3.1) of the form 

#T, r 2 ) = ^>i(r 2 ) + ’' • (53.5) 

[here + • ■ • refers to outgoing spherical wave terms] because (f)±(r u r 2 ) = 
cf)(f 1 ,r 2 ) ± <j>(r 2 , Fj are solutions of (5.3.1) which satisfy the conditions 
(5.3.2). 

In order to find solutions of (5.3.1) of the form (5.3.5), we decompose H in 
the following way: 

Ze 2 e 2 

H — H 01 + H A2 -+ — (5.3.6) 

r i r 12 
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where 


#01 = jPl 


H 


A2 


2m 2 r 2 


(5.3.7) 


H a is the Hamiltonian operator of the hydrogen atom. 

In terms of thejiotation used in XI, §§4 and 5, we may write 0(r l5 r 2 ) in 
the ( H a , L 2 , L 3 , 2i) representation as follows: 

h) -> <e, l, m;r l \<j) >. (5.3.8) 


Here s runs through the discrete and continuous spectrum of H A . In this 
representation (5.3.1) becomes 


1 _ z ^_ 
2ml 1 r, 


<£,/, m;ri|^)> + e : 


E f Ul,m 
l', m' Jo \ 


12 


s\ /', m' 


<e', /', m';r l |</)> de' + £ — E m rri 


l ,m' ,n 


12 


<£„, m'; ^ |<)5> = I — + E 


{^m + El ~ s ) < fi ’ 1 ’ m; 


^il<£>- 


(5.3.9) 


In this way <6, /, m|l/r 12 |e', m'> is a function of r,, namely the matrix 

which represents | r l — r 2 1“ 1 as an operator in :# hl which depends upon the 
parameter f y . 

Of course, it is not possible to obtain an exact solution of (5.3.9). Thus we 
shall consider two approximations. 

For the case of low excitation probabilities (that is, for sufficiently slow 
electrons) we consider the equation (5.3.9) for e = E u where we neglect all 
nondiagonal terms of <£ t , 0, 0| l/r 12 |e', /' m'>. We therefore obtain 

- 2~ A i + ^i)] < £ i> 0> 0; ^ < £ i> 0, 0; r t |0>, (5.3.10) 


where 


F(ri) = - — + e\ E u 0, 0 

fi 


,12 


E i, 0, 0 


(5.3.11) 


F(rj) is, according to (5.3.11), the Coulomb potential for a nucleus shielded 
by the charge density —e\(p^r)] 2 . Condition (5.3.5) may then be expressed 
as follows 


<£„ 0, 0; rJ0> = e Ck f ' + • • •, (5.3.12) 

where + • • • refers to outgoing spherical wave terms. Here (5.3.10) together 
with (5.3.11) represent nothing other than a scattering problem with potential 
function V(r). For slow electrons the atom behaves like an elementary system 
with the interaction potential V(r ) between the atom and the colliding electron 
given by (5.3.11). This example illustrates the importance of scattering 


Piifie. TfLa^JtASjp UM^i c jaJ. ?A.y Tt c J. 



266 XVI Scattering Theory 


problems (other than the Rutherford scattering in §5.2) which are described 
by a potential function. 

The quantity f k (9) is determined by the solution of (5.3.10) and the 
condition (5.3.12) [its asymptotic behavior] as follows: 

1 1 e ikr 

<Eu 0, 0; r | <£> ~ e ikr + f k (6) — (5.3.13) 

a result which is of great importance for the computation of the interaction 
cross section in §6.3. 

We shall now assume that we have solved (5.3.10). Then, in the next 
approximation step of equation (5.3.9) for e = £„ where n # 1 we only 
retain the sum of two terms: 




+ e 2 £ \E„, l,m 

V.m' 


' 12 


<£„, l, m; 

E„,r,m'KE n ,r,m';rM > (5.3.14) 


= -e 2 { E n , l, m 


1 


0,0 )<£ 1; 0 , 0 ; 


Condition (5.3.5) takes on the following form for <£„, /, m; r\<p} (n # 1) 

<£„, l, m; r, |= outgoing spherical waves only. (5.3.15) 

This condition may be satisfied because (6.3.14) represents an inhomo¬ 
geneous equation for <£„, /, m; r, |</>>. For its solution we need only choose 
the Green’s function for the operator on the left side of (6.3.14) which satisfies 
the “radiation condition”, that is, for large distances, only contains outgoing 
spherical waves. In order that such a Green’s function exist, it is necessary 
that 


that is, the kinetic energy of the colliding electrons must be greater than the 
excitation energy E„ - £, . If, for example, k 2 /2m < £ 2 — £ 1; then there is no 
excitation, that is, in the sense of the collision process, for t -» oo only a 
scattered electron and an atom in the ground state can remain. For elastic 
scattering the solution of (5.3.10) with (5.3.11) is only an approximation, 
albeit, a very good one. 

In the case in which k 2 /2m < E 2 — E l there are, according to (5.3.9) 
additional possibilities for an elastic collision, namely a term in (5.3.5) 
which, for large r 2 , takes on the form 

e ikn 

•••PiOr)—, 

r 2 

that is, the case in which electron # 1 remains in the ground state and electron 
#2 is distant from the nucleus. Here (5.3.9) is ill-suited for an approximate 
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computation of this effect because it would be necessary to work with a 
complicated eigenfunction (r 2 1 e, /, m) of H A2 . Instead, we return to (5.3.1) 
and make the following approximation assumption: 


h) = X(ri)(pi(r 2 ) + (?i(oMr 2 ), 


(5.3.16) 


where y(r,) denotes the solution of (5.3.10) subject to the condition (5.3.12). 
From (5.3.1) it follows that with 


m- 

J \r - r j 


(5.3.17) 


we obtain the following equation for t](r 2 ): 




- 


= e 2 (p(r 2 ) |^F(r,) - J <?>i(^i)x(^i) d 3 r t . (5.3.18) 

This equation may be solved in the same way as (5.3.14) with the help of 
the Green’s function which satisfies the radiation condition. 

The asymptotic form of the solution of (5.3.18) is given by 


n(h) 


ikr2 




(5.3.19) 


and determines the important amplitude function h k (9). 

We may also seek to obtain another approximation for the solutions of 
(5.3.10) and (5.3.18) by using the Born approximation, as in §5.2. For x(r t ) 
from (5.3.10) and (5.3.13) it follows that 

MO) = - J V(?)e ikr < t°- S ' d 3 ?, (5.3.20) 

where V(r) is given by (5.3.11). In order to compute tj from (5.3.18) we 
simplify (5.3.18) as follows: 


1 A / 

-A,- 

2m 2m 


d(r 2 ) = 


( 2 «) 


3/2 


g(r 2 ), 


(5.3.21) 


where 


0(r 2 ) = (Pi(r 2 ) F(r t ) 


/(' 


IG - r 2 


(ptfje**' d 3 r v (5.3.22) 


Thus it follows that 


n(r 2 ) = 


me 2 

(2 W 11 


/ 


£ ik\r 2 - f’| 


g(r) d 3 ir’ 
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from the asymptotic form (5.3.19) we obtain 

me 2 r 

K(0) = - j e~ ikr e g{r") d 3 r', 


(5.3.23) 


where e is the direction of r 2 ■ 

In the same approximation we may solve (5.3.13). For k 2 — k 2 + 
2 m(E l — E „) it follows that 


<£„, l, m; 


me 2 

(2^F 


Ji 


yik n l?l — f' I 


r, - r 





■e i!it 'd 3 ? 


and, in asymptotic form, we obtain 


me 2 




Lmeo-Ktyr / R , 

1 

1 -V —‘I 

J \ 

\r ~ r 2 | 


(5.3.24) 
E u 0, 0 ) d 3 ?. 


We will not attempt to evaluate the integrals in (5.3.20), (5.3.23) and (5.3.24). 
The derived expressions were obtained more to demonstrate the methods 
than to show (in approximation) the exact behavior of the individual 
scattering channels (after the scattering). 

The system of equations (5.3.9) or the equation (5.3.1) also permit the 
possibility of describing the ionization of the hydrogen atom by means of 
collisions with electrons. For approximation methods for such computations 
we refer readers to the literature [20]. 


6 Examples of Registrations in Scattering Experiments 

Now that we have considered (in §§4 and 5) the mathematical questions 
concerning the existence and the computational methods for the Wave 
Operators and Scattering Operator, we now turn to physical questions 
concerning the registration of systems after the scattering has occurred. In 
§2 we have only sought to obtain a mathematical formulation of what we 
mean by the expression “after” the scattering process. In order to obtain 
experimentally verifiable results we must know which effects are to be 
measured in the individual experiments. Since we have not yet developed a 
theory which permits us, on the basis of the construction of the registration 
apparatus, to draw conclusions about the nature of the corresponding effects, 
it remains for us to discover such F with the aid of theoretical considerations 
and experimental experience, and then to use the above information in order 
to establish axioms together with the corresponding mapping principles. Our 
procedures will be completely analogous to the procedures described and 
used in XI, §1. We will also show that the registration effects for scattering 
experiments are not, in general, decision effects. 
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6.1 The Effects of the “Impact” of a Microsystem 
on a Surface 


The apparatuses by which microsystems are registered in scattering experi¬ 
ments have, in general, a wide variety of forms. Experimental physicists are 
always inventing new registration methods. Here it is usually necessary that 
the operator F = il/(b 0 , b) which corresponds to a given effect procedure 
(, b 0 , b) must be determined from a mixture of theoretical and experimental 
considerations. There should be no doubt that working experimental 
physicists will take great care in this direction even though they may not 
express that care by using such an abstract expression as “seeking the operator 
4>(b 0 , by'. In their considerations, the intuitive desire for a special operator 
F = \l/(b 0 , b) plays an important role: They seek to construct a “surface” 
!F such that “every microsystem reaching the surface” is registered. What is 
the meaning of these “intuitive” words? 

We return to the special case of an elementary microsystem. In the 
Schrodinger picture, and in the position representation an ensemble of the 
form P >j , t can be described by a which satisfies the Schrodinger equation 

-7i <iW -> = _ 2S 4<?l *'>' (6 ' L1) 

The position probability density is given by 

w(r, t) = |<r|*A t >| 2 . (6.1.2) 

A direct consequence of (6.1.1) is the equation of continuity 

— w(r, t ) + div s(r, t) = 0, (6.1.3) 

ot 


where 


s(r, t) = 


1 

2 im 


[<r|^> grad<r|i/r ( > 


- <r |i/6> grad<r|^ ( >]. 


(6.1.4) 


Here s(r, t ) is called the probability current density because of its role in 
equation (6.1.3). Such terminology does not, however, specify if and how the 
named quantities may be measured. From Gauss’ theorem it follows that 
by integrating (6.1.3) over a volume Y bounded by the surface Y we obtain 


1 

dt 




(6.1.5) 


where n is the (outwardly pointed) unit normal vector to the surface element. 



( 6 . 1 . 6 ) 
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is the probability of finding a microsystem in the volume Y for a measure¬ 
ment of the “position at time t”. This probability may depend upon time. 
Since the position at different times are not commensurable, the time 
dependence of the probability (6.1.6) cannot be determined by measuring 
the position at various times on the same systems; instead, it must be deter¬ 
mined in the following way: We first measure the “position at time t” on 
systems prepared by a procedure a for which (p(a) = . Then, in another 

experiment we measure the “position at time t 2 ” on other microsystems which 
are prepared by the same procedure a. The time dependence of (6.1.6) (which 
is measurable only in the above way) satisfies (6.1.5). This fact suggests the 
following intuitive interpretation of s(r, t):s(r, t ) represents something like 
the net flow of particles through the surface area. With this intuitive inter¬ 
pretation, it is easy to be deceived about an actual physical problem: Here it 
is often believed that it has been proven that 

s(r,t)-hAfAt (6.1.7) 

describes the “probability” that a microsystem has passed through the surface 
element Af in the time interval At. Underlying this description there are 
physical problems because we do not know how to measure a quantity 
like (6.1.7). We will now show that (6.1.7) has the form tr(fU4) where W = P#. 
Instead of (6.1.7) we shall use the mathematically more precise expression 

f dt f s(r,t)-fidf, (6.1.8) 

Jll 

where 3F may be an arbitrary nonclosed surface. Using the formula 

J(6.1.9) 

for the transition from the position to the momentum representation, it 
follows that the expression (6.1.8) is equal to 

JF^ jhf f « + p) " df ' 

• <k'|i^ 0 > d 3 k d 3 k'. 

We now define a self-adjoint operator A as follows: 

A(k\4 >> = j<k\A\k'y(k'\4/} d 3 k’, (6.1.10) 


where 

<k\A\k'y 


1 r e (i/2m)(k 2 -k’2)t dt f e Uk'-k)-r k’)-ndf r 

In) 3 J tI Jj? 2m 


(2n) 3 

then (6.1.8) takes on the form 

tr (WA\ where W = P lj/0 . 


( 6 . 1 . 11 ) 
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Here (6.1.11) has the form of the Heisenberg picture because P llro is time 
independent, while the operator A, under displacements of t u t 2 by y changes, 
corresponding to the Heisenberg picture. For A given by (6.1.10) tr (WA) is 
determined for arbitrary W. What is, however, the physical interpretation of 
A in the formula tr(FF^4)? Is A an observable or an effect ? Since A is obviously 
not a positive definite operator, A cannot be an effect. 

If we interpret A as a decision observable, then from (6.1.10) it follows that 
A 2 is not a projection operator, that is, A has eigenvalues (and therefore 
measurement values) other than 1, 0 and — 1. This is hardly what we would 
expect “intuitively” by a probability current. From the expression (6.1.11) 
alone we cannot make any conclusions concerning the physical interpreta¬ 
tion of the operator A ! Failure to take heed of this general result often leads 
to errors. 

The quantity A obtained from (6.1.4) should, however, have something to 
do with the passage of a microsystem through a surface -F . Since, however, 
there are no objective properties for microsystems described by quantum 
mechanics (see IV, §8), it therefore makes no sense to say whether or not a 
microsystem, as such, passes through the surface F in the time interval t, to 
t 2 . A “registerable” effect can only be produced or not produced at a registra¬ 
tion apparatus. We must therefore have a registration apparatus which is 
capable (at least in principle) to register this “passage”. 

As our first attempt it seems that it would be reasonable to indirectly 
define an observable (by analogy with the indirect definition of the position 
observable) which registers the times for which a system passes from one side 
to the other of the surface F. Here it is reasonable to construct the following 
Boolean ring E: 

We begin by selecting intervals on the time scale. To each such interval./ 
we assign a pair of indices (+) and (-): F + and J -. We define E to be the 
Boolean ring generated by these intervals J+ and , where for each pair 
of intervals J+ andwe require that./ + a = 0. 

We now seek to obtain observables E -» Lsuch that F( J + ), F( J _ ) have 
the following intuitive meaning: the microsystem passes through the surface 
■ f in the time interval t, to f 2 from left to right and from right to left, respec¬ 
tively. In order to justify this intuitive description we shall, among other 
things, assert that F(J\) = e iH 'F(J^)e H/ whereis obtained from J by 
a time displacement y. This requirement cannot be satisfied by a decision 
observable! For nondecision observables we have no intuitive basis by which 
we may uniquely define the map F. On the contrary, we expect that real 
registration procedures should be described by effects which are in- 
separable(!) from the actual details of the construction of the registration 
apparatus, even in the case in which they appear as intuitively natural as a 
registration whether a microsystem “passes” or “strikes” the surface F or not. 

Nevertheless, experimental physicists look for effects after scattering 
which, for all practical purposes, do not depend upon the particular con¬ 
struction of the registration procedures; they measure, for example, the 
so-called “interaction cross sections” which depend only on the interaction 
between the colliding microsystems. How is this possible? 
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This possibility essentially depends upon two facts: First, experimental 
physicists use only “collection surfaces” which are large compared to |/c|~ 1 
where the k are the various values of the momentum where (k\i/j 0 y is, for all 
practical purposes, nonzero. Second, in scattering experiments experimental 
physicists do not use ensembles for which differences between various 
registration methods are readily detectable. These two different viewpoints 
are, of course, not mutually independent because the form of the surface F 
and the ensemble have to be adapted to each other. 

Such situations are not uncommon in physics, but are disturbing to 
mathematicians because they are not aesthetically pleasing. Physics does not 
always present aesthetically pleasing proofs. In fact, the situation described 
above could be considered to be a type of asymptotic approximate descrip¬ 
tion, which we actually have already introduced from the beginning of our 
description of the scattering process. The operator A in (6.1.10) has already 
been introduced in order to obtain an asymptotic approximate description 
of the effect of a system striking a surface area F. 

We now consider a plane E where F will refer to different portions of this 
plane. We shall denote the operator defined in (6.1.10) by A (J, F) where J is 
the time interval between t l and t 2 . 

A (./, F) defines an additive measure on the Boolean ring Z tj£ generated 
by the intervals J and the surfaces F. a result which follows directly from 
(6.1.10). This measure E -> F{:F) is, however, not positive. Here it may be 
suggested splitting A into positive and negative parts. For scalar measures 
such a decomposition is upique (see IV, §2.1). We will not discuss this question 
for measures for which the domain is a subset of a vector space F(F), 
because it is first necessary to examine the physical situation more closely. 

The actual situation of registration is not identical with any intuitive 
description of the passage of microsystems through a surface F. On the 
contrary, experimentally we seek to construct “detection surfaces” in which 
the systems can only “strike” the surface from one side in order to produce the 
registration effect (see Figure 39). This diagram is to be understood only in 
a very symbolic way. The technical details of the construction are not 
specified in Figure 39, but apart from all details the construction shall be 
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made in such a way that a “sensitive” detecting surface 3F is defined. The 
microsystems interact with the material of the detecting surface in such a way 
that a macroscopic effect is triggered at time t. 

From the theory of diffraction (in particular, diffraction of light) it is well 
known that it is possible to describe the radiation falling on a surface !F to 
very good approximation as if the radiation from behind (right in Figure 39) 
has been shielded, and the radiation from the front is unhindered. The 
diffraction phenomena at the apparatus can therefore be neglected, providing 
that we are only interested in the radiation falling upon the surface !F. We 
shall now seek to formulate this approximation more precisely. 

For simplicity we shall choose the coordinate system such that the plane 
E is spanned by the (2) and (3) axis, and the (1) axis points towards the 
interior of the apparatus (see Figure 39). Here it is reasonable to characterize 
the incoming radiation on the surface !F by requiring that the operator A in 
(6.1.10) be replaced by the null operator whenever k-novk'-n are negative. 
We now define an operator F (using the coordinate system introduced above) 
as follows: 

F(J, JF')<k|(A> = J<k|F|/c'></c'|i/f> d 3 k\ 


where 


<k\F\k?> 


_J f Ji/2m)(P-k' 2 ) f 0 il(k' 2 -k 2 )x 2 + (k' 3 -k 3 )x 3 ] 

C2rr) 3 J tl ] e 


• — (&! + k\)dx 2 dx 3 for/q > 0, k\ > 0, (6.1.12) 
2m 

otherwise. 


F{J, #") defines, like A(-f, .W), an additive measure on the Boolean ring 
Z (>£ . The conjecture that £ I>£ 4 L(Jf) is, unfortunately false, although in 
(6.1.12) k 1 >0 and k 2 > 0, because the operator defined by (6.1.12) is not 
positive definite. Therefore (6.1.12) does not represent an exact observable, 
that is, it cannot precisely define the production of effects. 

Equation (6.1.12) should, however, approximate the measure tM(b 0 ) -> L( Jf) 
for a registration method b 0 . On the basis of the actual characteristics of an 
apparatus, an experimental physicist knows that the occurrence of a detection 
signal cannot be localized to arbitrarily small time intervals ( t 2 — t,). In 
addition, the surface J* may not be too small in order that diffraction should 
not play a role, as we have described earlier. In addition, a real apparatus 
cannot respond to arbitrarily small k t . The approximation of 3t{b 0 ) -> L(Jf) 
by F(J, J 5 ") is therefore meaningful only if At = t 2 — t l is not too small, .W 
is not too small and the ensemble does not contain too small components k, 
of momentum. 

If At and J is not too small, then 

f e (H2m)(k 2 -k’ 2 ) dt f e W 2 -k 2 ) X2 + (k’ 3 -k 3 )x 3 ] dx 2 dx 3 

Jti J dp 
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decreases so rapidly in Iq — k \, that in the slowly variable factor (k , + k \), for 
all practical purposes /c, may be set equal to k\. Therefore it is reasonable to 
choose the following operator ¥{ J, 3F) 


JUk '2 - k 2 )X2 +(k 3 - k 3 )X3 


(k\F\k'y = 


f -1- f Jil2m)(k 2 -k’ 2 )t f p i[(k’ 2 

I (2tt) 3 J tl ), 

• — ■ s fk l ■ s /k\ dx 2 dx 3 for k l > 0 and k\ > 0, (6.1.13) 


otherwise 


instead of F(.J, S'). 

Here we shall not attempt to estimate the “error” 

I jj<.'l'\k)«k\F\k’) - (k | F |/c'»</c'11^> d 3 k d 3 k' 


and we will not show that this “error” decreases with increasing At and 
increasing S'. This “error” has no physical meaning since we already know 
that (6.1.12) is only an approximation to an actual registration procedure. 
Therefore it is meaningless to consider equation (6.1.12) to be the “correct” 
formula and (6.1.13) to be the approximation. We may only hope that real 
experiments (where At = t 2 — t x and S' cannot be chosen arbitrarily small) 
can be described “in good approximation” by (6.1.13); by making test 
experiments, we may determine whether our hopes are justified. An experi¬ 
mental error can only be defined as the difference between (6.1.13) and the 
effect of the actual registration procedure. 

We will now show that (6.1.13) actually determines an additive measure 
£,,£ -4 L(Sf). The additivity of the measure follows directly from (6.1.13). 
From 

JJ<^|/c></c|F|k')<fc'|^> d 3 kd 3 k' 

= —-Tj — (* 2 dt [ dx x dx 2 | f e (il 2 m)k 2 t e ~ i<k2X2 +k ^^(k\ijj)d 3 k 
(2tc) m J tl | J kl> o 

it follows that F(J, S') is a positive operator. If we show that for t l -> — oo 
and t 2 -> oo, and for#" = £ (the entire plane) then F < 1 we will have proven 
that Yji.e 4 L(SF). For q -> — oo and t 2 -> oo, and = E, from (6.1.13) it 
follows that 

<k\F\k'} - X -Jk,sjK, s(J- ( k 2 - k ,2 )jd(k' 2 - k 2 )8(k 3 - k 3 ) 


and we therefore obtain 


for k x > 0 and fcj > 0 


I<fc|«A>I 2 d 3 k < 1. 

Jki > 0 
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For real scattering experiments only such ensembles are used (because they 
are simpler to produce) for which W = E v w v P ll , v where the individual i f/ v 
have small variance in momentum (W itself can have a larger variance!). If 
k <v> is the average momentum for the ensemble P lj/v , then we approximately 
obtain 

tr(P*F)= ff<^ v |fe><fc|F|/c')<fc'|<A v > d 3 kd 3 k' 

JJ (6.1.14) 

^.(v) Ml /• I r |2 

» 3 dt \ dx l dx 2 \ \ e ilk2X2 - k3X3 -< k2l2m)t \k\il,\d 3 k 

m(2n) J tl Jjjr I J kl >o 

The result (6.1.14) is very intuitive, because from 

<r|«A v( > = ^2 Je® f - (fc2/2m),1 <fc|<Av> d 3 k 

(6.1.14) becomes 


- k^ r t2 r 

tr(P^, V F)«— dt \ |<0, x 2 ,x 3 

m J tl 


|i^ vr >| 2 dx l dx 2 


(6.1.15) 


where 1 is the mean velocity in the direction of the normal to the 
surface and | <0, x t , x 2 \ | is the position probability function at time t on 

the position (x 2 , x 3 ) on the plane E. 

We shall now provide a summary of our considerations: Experimental 
evidence shows that it is possible to construct registration methods b 0 with 
a surface .F in the sense of Figure 39 in such a way that for the registration 
procedure b which produces a signal in the time interval J the effect >l/(b 0 , b) 
is represented to a high degree of accuracy by F( J, .F) providing that no 
extremely small values of /c, are present in the ensemble. We call F(.f, #") 
given by (6.1.13) the “effect of the impact of a microsystem on the surface & 
in the time interval J". 

The intuitiveness of the expression (6.1.15) is the basic reason why it is 
usually not necessary to examine the real difficulties associated with the 
registration process (and thereby avoid making serious errors), because the 
ensembles used in scattering experiments are already so constituted, and the 
surfaces F so selected that the conditions for the application of (6.1.15) are 
satisfied. 

Since we are concerned in this book with a realistic and sober conceptual 
analysis of quantum mechanics, including the experiments used for measure¬ 
ment, we have found it necessary to examine the conditions for the applica¬ 
tion of (6.1.13) and (6.1.15) more closely. 


6.2 Counting Microsystems Scattered into a Solid Angle 

Now that we have discussed the fundamentals about the effects F(.f, .F) 
which describe the impact of microsystems on a surface F in the time interval 
«/, we can now proceed to discuss additional specialization and applications. 
We shall not carry out all computations and estimates explicitly. 
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Scattering experiments are carried out in such a way that the surfaces 
A#" are placed at great distances from the scattering region, and the number 
of those systems which strike the surface AtF after scattering in a given time 
interval J are measured. The experimental situation described in the previous 
sentence will now be expressed in mathematical form, that is, in terms of 
assertions about the structure of the prepared ensembles and registered 
effects. These assertions will not be laws of nature in a new axiomatic formula¬ 
tion associated with the theory. Instead they will be brief formulations of the 
structure of the experiments which can be verified before carrying out the 
“proper scattering experiments”. Here experimental physicists will either 
speak of test experiments, calibration of the apparatus or of verification of the 
technical specifications of the apparatus. These yet-to-be-formulated 
structure assertions are to be deduced in the theory from those relations 
which are nothing other than the mathematical formulation of the results of 
the test experiments. 

In the momentum representation we require that (ki\W\ 0 \k l y and 
frequently introduced concerning the experiments without much discussion 
in such words as the following: “We consider experiments in which... is true”. 
Here we note such a statement is a substitute for the more precise statement- 
additional structures are introduced based upon experimental facts which 
have been experimentally verified. Such situations are distasteful to mathe¬ 
maticians because, by admitting new formal axioms based upon experi¬ 
mental results, the mathematical part of a physical theory will never be 
“axiomatically closed”. Readers who are interested in a more precise con¬ 
ceptual formulation of this situation are referred to [8] and especially to the 
notion of a “real hypothesis” in [8], §10.4. 

In this sense we now “consider experiments for which the following is 
true”: Before the scattering the ensembles satisfy the requirements made in 
§1 where W\ 0 and IT 20 are such that the position probability densities 
<ri|W" 10 |ri> and (f 2 \W 2 0 \r 2 y are nonzero only in a macroscopic neighbor¬ 
hood of the origin of the laboratory reference system. Here we refer to this 
region as the collision region for the experiment. 

In the momentum representation we require that <^il W"iol^i> and 
(k 2 \W i 2 o\k 2 } is nonzero only in a (as small as possible) region about 

(resp. k 2 ). Here the experimental physicist will seek the smallest possible 
dispersion in the momenta for scattering experiments. 

Later we shall impose additional requirements on W‘ 10 and W\ 0 ; at present 
these requirements will not be well understood: 

The position of the collection surface A#' is adjusted relative to the 
collision area such that no system will be counted if no scattering is taking 
place (that is, when experiments involving only systems of type 1 from 
ensemble W\ 0 or type 2 with ensemble W l 20 are taking place). We say con¬ 
cisely that the surface A# - is not allowed to be placed in the direct beams of 
either system 1 or 2. 

In addition we require that A# - is placed at a great distance from the 
collision domain in order that the normal vector for A#" shall be parallel to 
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the direction of the position vector of the surface r (see Figure 40). By this we 
mean that | r | is large compared to the size of the scattering region. The vectors 
k x , and r (towards A.#) determine three directions of considerable experi¬ 
mental importance. 

We are now interested in the case in which the surface denoted by A,F 
responds only to systems of type 1. Our purpose is to compute the probability 

tT(S(W\ 0 x W‘ 20 )S + (F 1 (^ ) A.F) x 1)) (6.2.1) 

under the assumptions made above. The x product in (6.2.1) refers to the 
representation x ^f 2 and F X {J, AJ) is the effect introduced in 

§6.1 for the detection of systems of type 1. 

In order to use the assumptions previously introduced for W\ 0 ,W l 20 , we 
rewrite S in the form 

S = 1 - 2niT (6.2.2) 

by which we define the operator T. In the absence of the interaction, T would 
be the null operator. 

The assumption that the undisturbed beam does not hit A# - means that 

A#')) « 0, (6.2.3) 

from which it follows that W[ 0 F l x 0. Thus, from the relation tr (AB) = 
tr(BA) it follows that (6.2.1) is transformed into 

(2n) 2 tr(T(W\ 0 x W^T + [F , A.F) x 1]). (6.2.4) 

The operator T has a close relationship with the function g^r/\ r |) introduced 
in (4.7.10). As we have stated in §5.1, Q.e ikf = and Se iii r are identical 
with respect to the outgoing spherical wave. This means that, for large | r \ 

/ r \ e ikr e~ ikr 

— 2niTe ik f xgA--) — + -. (6.2.5) 

\M/ r r 

In (6.2.5) T is to be understood as an operator in -Tfj where occurs in the 
product representation VIII (1.1) of Since S commutes with H, this is also 
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the case for T. In the momentum representation we prefer to express T in the 
form 


<fc'|T|fc> = s(^ - T(k', fc), (6.2.6) 

\2m 2m) 

where T(k', fc) is defined only on the energy surface k' = k. Introducing polar 
coordinates (r is the polar axis) we obtain 

Te* f = jV E ' ? <fc'|T|fc> d 3 k ' 

r'In /•» /*1 /h’2 k2\ 

= j dcp j fc' 2 dk’ j d^dl— - — \T(k’, k), 
where we note that fc' is to be expressed in polar coordinates as follows: 


fc', £ = cos 9, q>. 

Integrating by parts over for large | r\ it follows that 


Te ik r « 2tc 


r® e l 

fc' 2 dk' - 
Jo «; 


ik-yfc' 2 


fc 2 

r \2m 2m 


r|fc'—,fc + 




By comparison with (6.2.5) we therefore find that 




We therefore obtain 




and 



— (2n) 2 mT[ k —, fc 
k 


(6.2.7a) 



2m 


T(k', fc) d 3 k' 


2n e ikr 
— m — 
i r 



(6.2.7b) 


Since T has, with respect to Jf = x .7^, the form 1 x T (where the latter 
does not correspond to the x product in (6.2.4)), it is preferable to express Tin 
terms of the momentum representation with respect to P u P 2 , and to use the 
separation of the center of mass, as described in §3. Using fc 1; fc 2 as the 
variables of the (P u P 2 ) representation, and K, k defined by (3.4) and (3.5), 
we obtain 


K — fcj + fc 2 , 



( 6 . 2 . 8 ) 
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then, from the form 1 x T with respect to JC = x jfj we obtain 

<k\, k' 2 \T\k 1 , k 2 y = 8(K' - K)d(~ - ~y(k', k). (6.2.9) 

Here (6.2.4) takes on the form (using a as the complex conjugate of a): 

( 2 tc ) 2 yk'uk^Tlk^XhlWiolk'i) 

■<k' 2 \Wi 0 \k 2 Xk'{', k' 2 \T\k'ik'i) 

■<k7\Fi\k\)S(k' 2 -k2) 

■ d 3 k 1 d% d 3 k\ d 3 k' 2 d 3 k'[ d 3 k'i d 3 kj" d 3 k'^, (6.2.10a) 


where, according to §6.1 

e ‘(ki-k"o • ii X'k'i ’« 4f 


<l te (i/2m t W'2 -ki 2 )t 


J - 

Ja& 


(6.2.10b) 


(where n = r/| r |) or is equal to 0 if k\ ■ n or kj" • n is negative. 
Using (6.2.9), (6.2.10) takes on the form 


(2n) : 




K)8(K"' - K")d(f- - ~)s( k '" 2 


2m 2ml \ 2m 


IS 2 ) 
2m) 


• T(k', k)T(k"', k")<k 1 1 W{ 0 \k'[}(k 2 \W' 20 \k'i> 
■ <k'i\Fi\Ky8(k' 2 - k 2 ) d 3 K' d 3 k'd 3 K"’ 
■d 3 k"' d 3 k l d 3 k 2 d 3 k\ d 3 k 2 


2n 


2 k 2 
«5{ — - — 
2m 2m 


T(k', k)S 


k'" 2 

2m 


_IS)\ 

2m) 


T(k'", k")</c 1 | lUj 0 |/cj'></c 2 1 Wl 0 \k 2 y 
•</c"'|F 1 | k' x y* - • '2 d 3 k' d 3 k"' 

■ d 3 k l d 3 k 2 d 3 k'[ d 3 k 2 d 3 r 2 , 


(6.2.11a) 


where, on the right-hand side of the last equation, K' is to be replaced by 
K and K'" by K". Since the function 8(k' 2 — k 2 ) occurs in (6.2.10a), we may 
replace 

^-(kf-k?) 

Z.IYI ^ 

in (6.2.10b) by 


1 

2m x 


(K 


nr 2 


> + - *?> - i »" 2 - *”>+ 
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that is, in (6.2.11a) we may replace (k’x\F 1 \k' 1 ') by the value 


m 1 (27i) 3 J f 


fa e (il2m)(k"'2-k'i)t e (i/2M)(K"'i-K'l)t 


f e iCk ' - df, (6.2.11b) 

J\SF 


where we have to set K'" = K". K' = K. 
From (6.2.8) it follows that 


m 


k\ ■ r = k' ■ r H- K ■ r, 

m 2 


k’ -r 2 H- K ■ r 2 . 

m 1 


k 2 ‘r 2 - 

Therefore we obtain 

k\ ■ r + k’ 2 ■ r 2 = k' ■ (r — r 2 ) + K ■ 
and similarly, 

k"; ■ 7 + k!'i • r 2 = k'" • (? - r 2 ) + K 


m 


m 


m _ 

— H- r 2 

m 2 mi 


m _ m _ 

— r H- r 2 

m 2 m 1 


Thus it follows that the integration over k', k'" can be carried out separately. 
Using (6.2.7b), for large | r — r 2 | we obtain for (6.2.1 la) 

„2 




- (i/2m)(k 2 - k" 2 )t - (i/2M)(JC 2 - K" 2 )t 


■ y JW 1 ■ n y/k'i ■ n 


gHk-k”)\t-f 2 \ 

\r~r 2 \ 




•(fcil^tolfciX^l^olfc'a) 

. gi(k- K’)- (mltrnr + mltrnrz) 

■ d 3 k l d 2 k 2 d 3 k'[ d 3 k 2 d 3 r 2 , 
where k\, k’[', k, k" are defined as follows: 


( 6 . 2 . 12 ) 


m 


k'^kp^ + ^K; k7-k"^ 

\r - r 2 \ m 2 \r - r 2 \ m 2 


+ — K" 


k = 


m r m r 

rCj rC 2 

mj m 2 


k" = 


— k'{ - — k!' 2 
m x m 2 


(6.2.13) 


The time dependent exponent can be replaced by the original expression 


-(i/2 mi m 2 -kr 2 )t 


( 6 . 2 . 14 ) 
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Equation (6.2.12) is the general formula for the computation of the proba¬ 
bility for the impact of a system of type 1 on the surface A#' (at the location r) 
in the time interval t 1 to tj. This probability depends, in general, upon the 
“wave packets” (k 1 \ W\ Q \k'{} and (k 2 \ Here precise control 

measurements are required to determine the structure of these wave packets, 
that is, of the particle beams for systems 1 and 2. It is clear that without such 
control measurements for the incoming beams scattering experiments will 
not work. Generally we seek to keep these to a minimum by choosing special 
forms of W\ 0 and W' 20 in order that we may be able to investigate the inter¬ 
action between the systems undergoing the scattering pfocess. We will 
consider only two examples to demonstrate how we may choose special forms 
of the experimentally prepared ensembles W \ 0 , W‘ 2 0 : 

(1) m 2 > m x and we choose the average momentum k 2 = 0. 

(2) the interval t 2 — t 2 is chosen so large (that is, we consider the mathe¬ 
matical limits t x -» — oo, t 2 -* oo) that all the systems striking the 
surface A.W are counted (independent of when they strike). We shall 
now begin with case (1). 

In addition to m 1 /m 2 4 1 we require that k 2 /m 2 4 k l /m l for all k x , k 2 in 
the wave packets. This means not only that the average of the velocities of 
system 2 is, for all practical purpose, zero but also that these velocities have 
very small dispersions (compared to velocities of system 1). Under these 
assumptions, the velocity of the center of mass K/m is, for all practical pur¬ 
poses, equal to zero: K/M 4 k 1 /m 1 . 

From these assumptions it follows directly that 


k « k u k" k k'[. 


P ~ t- r r 2 p" ^ i." 

kl ~ kl W^2 f * 


Using (6.2.14) in (6.2.12), the latter takes on the form 

r‘2 


m 


(2 n) : 


[dt f df\ 

Jt , -I \& -I 


e *(i/2m,)(tf-k'i 2 )'fc0^ . 


r 2 


Jt, -JA3J 
ff i(k-k")\r-t 2 \ 


\r~r 2 \ 


\r ~ r 2 


3 AK^rkMK r ~ r2 


I r - r 2 1 


I r - r 2 1 


r -r 2 

\r-? 2 \‘ 



■ <ki I W\ 0 \k’[')e i(R ' + ' i2 ~ il ‘ i ~ H) ' ((m ' lm2>f+f2) 

■ <^ 2 ! ^ 20 1 ^ 2 ) d 3 k x d 3 k 2 d 3 k" x d 3 k 2 d 3 r 2 . (6.2.15) 


From (6.2.15) it follows that r 2 need only be integrated over a region for 
which \r 2 \ 4, |r|, as we shall soon see. In order to evaluate (6.2.15) more 
precisely, we shall, for \r 2 \ 4 | r | use the following approximation 

\r ~r 2 \~ kl - r 2 -~. 
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From (6.2.13) we obtain the approximation 


k - k" 




m 

m i 


K 


m 

m 2 


k 


ft 

2 


« (fci - K) - 


m i r - r „ _ 

/c 2 ' “I ^2 ' * 0 , 

m 2 m 2 


where e 0 = k° x /k x . Thus, in the exponent we finally obtain 
(k - k")\r - r 2 \ = (k x - k x )r - (k x - k" x )r 2 ■ e 

- - k’f)-e 0 r, (6.2.16) 

m 2 

where e = r/r. 

This better estimate is required for the exponent because r and k l can be 
relatively large. If we use this approximation, then we can easily perform the 
integration over k 2 and k 2 by using the formula 


<?2i*n 0 ki> = 


for the transition from the momentum representation to the position repre¬ 
sentation. Thus from (6.2.15) and using (6.2.16) we obtain 


2; zm x P dt f df f e -w 2 ».x*I-M J » fc o 

J (1 Ja^ J 


a i{k\ -k\)r 


■ m^kjnk'ii k'lKk^w^k'iy 

. gi(k I -k'{)'(t2 + mtlm2f)-i(kt—k'{)e-t 2 


m 


m. 


m, 


m i 


— r + r 2 -e 0 r| W 20 \— r + r 2 - - e 0 r 

m 2 m 2 m 2 m 2 

■ d 3 k x d 3 k' x d 3 r 2 . 


(6.2.17) 


We now introduce the new integration variable 


? 2 = r 2 + ^ ?-^e 0 r (6.2.18) 

m 2 m 2 

and set df = r 1 dm(e), then (6.2.17) is transformed into 
2 nm x Cdt f dco(e) 

•ft i J J 

■ fc?T(M, ki)T(k" x e,K)<ti I W"iol^> 

. gi(^l-fc'l)-(f 5 + ('«l/m2)So'') . e -i(k 1 -ki)(g fi-(m t /m2)f + (m t /m2)e-e 0 i') 

■ <?2 1 ^201 ?2> d 3 k 1 d 3 k'{ dr 2 , (6.2.19) 

where Afi is the solid angle corresponding to the surface AJ^ 
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(r' 2 1 W 20 | r 2 } is the position probability density of the system 2 at “time 
t = 0” if there is no scattering. According to the instructions for the experi¬ 
ment (r 2 \W i 2 0 \r' 2 y is only noticeably different from zero in the collision 
region. In addition r must be large compared to the collision region, that is, 
i r’ 2 1 < r. Thus from (6.2.17) it also follows that | r 2 1 < r, from which it follows 
that the above assumption about r 2 and therefore our approximation 

(6.2.18) are justified. 

Since the momentum in W\ 0 has small dispersion, we may approximately 
set 

(k, - fcj) = (k x - K)e 0 + k xl - K ± , 

where k xl , k xl are the components of k x and k'[ which are orthogonal to Co¬ 
in the case in which T(- • •) does not vary too quickly with the direction of e to 
k, we may set 

T(k x e, k x ) = T(k x e, k x e 0 ), T(k x e, fcj) = T{k'[e, k'[e 0 ) 

(6.2.19) is then transformed into 

27cm 1 /c? V dt f dco(e) [ e m ^ k ' ){r+2{milm2)r ^ (m ' lm) ‘'‘ or+So ' n ~^''^ 

Jti Jasj J 


' e -(il2rm)(kl-k'{ 2 )t 


T(k x e, k x e 0 )T(k" x e, k x e 0 ) 


■ <ki\W[ 0 \k’[}e i(k '- <ri| IT‘ 2 0 1r 2 > 

■ d 2 k xl d 2 k xl d 3 r 2 . (6.2.20) 


Here we may integrate over £ 1X , k xl . To simplify the notation we set 


1 

(In) 2 


I W'i 0 \k'l)e- in ' i ^'<d 2 k 11 d 2 k" xl 


= <k»?u.\W t 1Q \kr u r” 1± >, 


( 6 . 2 . 21 ) 


where r xi and r" xl are the components of r x or r" x which are orthogonal to 
e 0 . Here (6.2.20) is transformed into 


(2n) 3 m x k° x dt dC0(e) \ e ^,-k'0(r + 2( mi / m2 )r--( mi l m 2)e-tor + ^2-i-ti) 

•'An J 

• e~ <ll2m)(k ^~ k ' l2) T(k x e, k x e 0 ) T{k" x e , k" x ej 

■ <k x , r 21 \W\ 0 \k" x , ? 21 ><r 2 1 W‘ 2 0 \r' 2 ydk x dk" x d 3 r' 2 . (6.2.22) 

We obtain a further simplification under the following assumption: If 
(t 2 - t x )(k x /m x ) is large compared to the size of the collision region and 
compared to ( m x /m 2 )r (that is, r is not “too large”), then we may neglect in 
the exponent in (6.2.22) ( e 0 — e) ■ r 2 and the terms in ( m x /m 2 )r . If the wave 
packet <i*x I W[ 0 \ r' x > has the property that, in the “target region” (that is, 
where <j 2 \ W 20 \ r 2 ) is essentially different from zero) the quantity 

(.ki, r 2X _ \ W' xo \k'[, r 21 y 
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does not, for all practical purposes, depend upon r 2 ±, then we may integrate 
over r' 2 and, using 

j<?2\W i 2o\r r 2>d 3 ?2 = tx(W t 20 )=l 

obtain the following expression for the probability of a system striking 
in the time interval t 1 to t 2 : 

(2n) 3 m 1 k° 1 dt f dco(e) fc®* 1 “-*i" 2 )»] 

Jti Jan J 


T(k 1 e 1 k 1 e 0 )T(k'ie, k’[e 0 ) 
{k u 0\W\ o \k'[,0) dk t dk'[. 


(6.2.23) 


The assumption that (k u r' 21 | W' 10 \k" u r 2 ±} does not, for all practical 
purposes, depend upon r' 21 in the target region is the so-called postulate of 
“homogeneous beam density for system 1”. To see what the notion of a 
homogeneous beam density means we now compute the probability for 
the impact of particles of type 1 on a surface perpendicular to e 0 at the 
location r-e 0 = x (that is, for the case in which there is no interaction with 
particles of type 2). Then, from (6.1.13) we obtain the probability (using the 
same approximation) 


k°i 

m 1 (27t) 3 


y J dt J d 2 rl je mktl 

<k x \W[ 0 \k'l) d% d 2 k’[ 

s f 2 f d 2 rl 

m x 2n J h Jj, J 


- fti'x) r + (III - k’i)x - (X/2m)(fc, - k'{ 2 )t] 


)x — (l/2mi)(k 2 k'{ 2 )t] 


< k l ,r 1 \ W\ 0 \k’[, r L } dk x dk'{. 


(6.2.24) 


If, for the location of the surface 3F, we set x = 0 and consider (6.2.24) for 
different small surfaces A# - in the target region, from (6.2.24) we obtain 

k°AF r‘ 2 C 

™ J dt J g-<W».x*f-*r*» <fci> r x | W\ 0 \k'[, r x > dk, dk'[, (6.2.25) 

where AT is the surface area of AJ*j and r L is the location of A#! The proba¬ 
bility (6.2.25) is constant if </c x , r x | W[ 0 \k'[, r ± ) does not significantly vary 
with r x within the target region, that is, if we may replace 

<,kyJ lL \W\ 0 \k" u r lL ) 

by (k u 0| W\ 0 \k'{, 0>. This is the meaning of a homogeneous beam. From 
(6.2.24), in the same approximation for x = r we obtain 

k°AF (“ 2 r 

-i— dt e'tffti—fc'i )»•—(i/2rMi) ( fcf—ic? 2 )t <^ 1 0\W\ o \k'[, 0> dk x dk" x . (6.2.26) 

2nm ! J fl J 
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The equation (6.2.26) describes a comparison experiment to (6.2.23) in which 
we place the surface A# - directly in the beam path of system 1 in the absence 
of the system 2 beam, that is, without scattering. Here it is interesting to 
determine whether the probability for time intervals in the case (6.2.23) is 
essentially delayed in time compared with the case (6.2.26); see, for example 
XVII, §6.5. 

For tj -> — oo, t 2 -* oo, (6.2.22) is transformed into 

(2b)V*i J a£j date) J dk x | T(k x e, Mo) 1 2 js(J±- - (6.2.27) 

■ (k u o\w[ 0 \k'ioy dk'{ 

and from (6.2.25) we obtain 

k°\F r / k 2 k ,,2 \ 

(62 ' 28) 

The formulas (6.2.27), (6.2.28) are the basis for the computation of the 
experimentally important notion of the “interaction cross section” (see §6.3). 

We shall now consider case (2): We further assume that T(ke, ke 0 ) not only 
varies slowly with e and e 0 , but does not vary rapidly with k. This is not the 
case when there are sharp “resonances”, that is, when there exist “decaying 
states” (see XVII, §6.5) with long lifetimes. For -> — oo and t 2 -> oo, 
(6.2.12) takes on the form 

e ~ i[(l1 2m)(k 2 ~ k" 2 ) + (11 2M)(K 2 — K" 2 )]t 



_ e 


(k[° ■ n) 


i(k-k")\r-f 2 \ 


\r~r 2 1 


Tlk 0 


r ~ r 2 ti 


\r~r 2 \ 


k° 


• | W\ 0 | fcj + > l 2 -k'i-k’i)- {{mjm 2 )f + (m/m! )t 2 ) 


where 


• <£ 2 1 ^ 20 ^ 2 ) d% d% d 3 k'{ d 3 k'{ d%, 


kO=™k%--k°, 

m 2 


k; 0 = k° 


k-r 2 | m 2 


(6.2.29) 


(6.2.30) 


We will evaluate (6.2.29) for two subcases: 

(a) k° 2 = — k° u that is, the center of mass is, on average, at rest. This case 
can be realized, for example, by using colliding beams. 

(fi) k 2 = 0, that is, system 2 is at rest (on average) before the collision; 
system 2 as target for the system 1; this case is called the collision in 
the laboratory system. 
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For the case (Ji) we have already found an expression (6.2.27) using the 
assumption mjm 2 < 1. 

In case (a) from fc 2 = — and (6.2.30) it follows that 

K° = 0, k\° = k\ = k'°. (6.2.31) 

\r~r 2 1 


As long as the momentum distributions of systems 1 and 2 do not have high 
dispersion, we may expand the expressions in the exponent in (6.2.29) into 
linear terms in the differences k 1 — k° u k 2 — k 2 , etc. From (6.2.29) it follows 
that since K 2 - K" 2 is quadratic in the deviations: 


m 


(2n ) 2 m 


- 1 *! f 

1 J |_ J - 




k\° • n 
I r- r 2 1 2 




(k x | W \ 0 1 kl > <fc 2 1 W 20 I k 2 y 


• e i(E l +J 2 -Sl-Ej).((»/» 2 )f + (m/m 1 )f 2 ) d 3 jJ i ^ 3 ^ rf 3 £„ ^ 


(6.2.32) 


Since the integral over the time is proportional to <5(/c — k"), we have set the 
factor = i 

In accord with our assumptions, we may approximately set 


k 2 - k" 1 = 2k\ 


— (fei - k'{) -e 0 - (k 2 - k" 2 ) ■ e 0 

\jn ] in 2 


. (6.2.33) 


Thus, in (6.2.32) we may integrate over d 3 k l d 3 k 2 d 3 k'[ d 3 k 2 ; we obtain 
m 2 (2n)’ 


J An J I r - r 2 \ 2 \ \r - r 2 \ J 


m _ m _ fc? _ 
— r + — r 2 - e 0 t 


r — r 2 
W l 


m 2 

m 

m? 


m 


m 


10 


m m _ fe? _ 

— r H-r 2 - 

m 2 Wj 


_ m _ fc? _ 
r H-r 2 H - Sq t 


m, 


m. 


IF,' 


20 


m _ m ^ fc? _ \ , 

— y + — r 2 + — ) w r 2 wt. 

m 2 mj 


According to the assumptions the quantities 


ri = — r + 
m 2 


m _ _ 

r 2 Co f 

m x m x 


and 


m _ m _ fc? _ 

r 2 = — r + — r 2 -\ - e 0 t 

m 2 m x 


are of the order of magnitude of the size of the collision region in order that 
neither \ W\ 0 \ r \) or <r 2 1 IF 20 \ r ' 2 > are equal to zero. Thus it follows that 


k\ _ k°, _ 

— e 0 t, — e 0 t and 
m x m 2 m. 


m _ m _ 

H-r 2 

m, 
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must also be of the same order of magnitude. For r large compared to the 
size of the collision region we therefore find 


_ m 2 _ 

r 7 ~- r 

m 2 


so that we may replace 


r - r 7 


by 


1 


(1 + m 1 /m 1 ) 2 r 2 


and 


r ~ r 2 , r def _ 

T =—=-7 by - = e. 

\r~r 2 1 r 

On the basis of these considerations we may, instead of (6.2.32), also write: 
m 2 k° 


f c c° 


" (i/ 2 m)(fc 2 — k" 2 )t fa 


(6.2.34) 


(1 + mjm 2 ) 2 m 

■\nm k 0 1 e 0 )\ 2 (k 1 1 W \ 0 1 k'l y(li 2 \ W 20 \ H 2 y 

. g «£,+fe-tT-e)-((m/M2)»+(m/»,)» 2 ) <i 3jf i 

Integration over r 2 yields a S function. For this reason we may use the 
following approximation instead of (6.2.33) 

k 2 - k" 2 = 2k°(k 1 - k'[) -e 0 = k 2 - k'[ 2 . 

From (6.2.34) we therefore obtain the new expression: 

m 2 fc° 


h2n date) J (k 2 - fc" 2 )j | T(k% k%) | 2 


(1 + mjm 2 ) 2 m 1 

' <^i I W\ 0 1 fcj><^ 2 1 W 20 1 k 2 > 

gi(t| +k 2 -ki'-k"2) ((m/m 2 )r + (m/mi)?2) d, 3 k y d 3 k 2 d 3 k'[ d 3 k ' 2 d 3 r 2 

We may now carry out the integrations over k 2 and k 2 . Together with 

< 5 ; «-*?>) 

we finally obtain 

dco(e) J<5^ {k\ - kf)) | Tk% k?e 0 l 2 <fci I ^iol^ 


pi( fc 1 - fc j) ■ ((m/m2 )r + (m/m 1 )r 2 ) 


m 


m 


— r H- r 2 


m. 


m, 


W! 


20 


m _ m 


m 2 


m, 


H-r 2 ) d 3 k x d 3 k'( d 3 r 2 . 
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Similarly as in the evaluation of (6.2.20) it follows that the impact probability 
will be given by 

dco(e) | d(~ {k\ - k f | T(k\e, k%)\ 2 

• <k» fiMolku ?2±><j2\W i 20 \? 2 )dk 1 dk'[ d% 


and, under the same assumptions, since < k u r 21 1 W[ 0 \ k t r ' 21 > does not, for all 
practical purposes, significantly vary with r 21 in the target region, we may 
integrate over r 2 . We obtain 


m 2 k\(ln) A 

m, 


f da)(e)\T(k 0 1 e,k° 1 e 0 )\ 2 
■Jaci 



(k u 0| W\ 0 \k'l, 0) dk 2 dk'[. 


(6.2.35) 


We now consider case ( ji): Since k 2 = 0, from (6.2.30) it follows that 


n° = -k%, 

m 2 


k'? = k°{^-+™k<{e 0 . 
\r-r 2 1 m 2 


(6.2.36) 


In the time dependent exponent in (6.2.29) we may write the energy difference 
in the form 


Since k 2 = 0, and since we consider only linear terms in k 2 ; we may neglect 
k\ — k 2 2 . Equation (6.2.29) then can be written as follows 


m 


2 


2nm l 



gi(k — k")\r— r 2 \ 





<k 1 \W t 10 \K> 


■ <h\W‘ 20 \k'i} 


. e m+lt2-k'{-k"2)-((mlm 2 )r + (mlm 1 )r 2 ) d S]l i ^ ^ ^ ((,231) 

Including linear terms in it follows that 


YYl 

k-k" = - — e 0 -(k 2 -K), 
m 2 
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where we have set k'{ = k, because of the S function. Thus (6.2.37) is trans¬ 
formed into 




(kj - k" 2 ) 


T k 1 


.0 r ~ r 2 


r 2 


|,£°) 


k'° ■ n 

\r~r 2 \ 2 


. gi(ki-k'i)-((mlm2)f + im/mi)f2) ~ k'i) ■ {(ml m2)r + (m/mi)r 2 - (m/m2)eo|r-hi) 

• <Zi\W\ 0 \k'lXk2 1 ^ 20 ^ 2 ) d 3 k x d% d 3 k\ d 3 fy d%. 


The integrations over k 2 , k 2 may be carried out as follows: 


k'i ■ n 
I r - r 2 \ 

m _ m _ 

— r -1- r 2 


e i(U-k'{)((mlm 2 )f+(mlmi)f2)(k^ | W\ 0 \ k"^} 


c 0 |r - r 2 11 Wiol — r + — r 2 - — e 0 k - ^2 


m 2 ntj 


m 2 


i m 2 m x m 2 
■d%d 3 k'[d%. (6.2.38) 

Instead of r 2 we introduce the following new integration variable 

m 


_ m _ m _ 

f' 2 = — f + — r 2 - e 0 \r - r 2 \. 

m 2 m x m 2 


(6.2.39) 


Thus (6.2.28) is transformed into 
m 2 (2n) 2 


— f df\s(^-(.kl-k?) 

m x Jap J \2m 1 


Tlk°~ — V J~, k 0 ' 


\r~r 2 1 


t' 0 • i? - - 

1 _ J(k 1 + 


|r - r 2 | 




d? 0 


dr 


-1 


d 3 k 1 d 3 k'[ d 3 r 2 , (6.2.40) 


where r 2 is considered to be a function of r' 2 and | dr’ 2 /dr 2 1 is the functional 
determinant of the transformation (6.2.39). Since r 2 is only of the order of 
magnitude of the size of the target region, from (6.2.39) we obtain the following 
approximation 


m _ 
— r + 
m 2 


m _ m „ . 

— r 2 -e 0 |r - r 2 » 0 

m x m 2 


(6.2.41) 


which suffices as an approximation for r 2 in | r — r 2 \ 2 . In the same way we 
may compute kf with this approximation according to (6.2.30). We will show 
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that k'x has the same direction as r. For simplification we introduce the 
following notation 


r - r 2 _ 

e = 73 — 3-7 and e, 
r - r 2 


r 



From (6.2.36) it follows that 

to = k?J—e + — e 0 ). (6.2.42) 

\m i m 2 ) 

From (6.2.41) it follows that 

_ , / m _ m ^ \ 

re 1 = \r 1 -r 2 \\—e + — e 0 . (6.2.43) 

\ m i ™ 2 ) 

Thus it follows that kf has the direction of e x = n, so that in (6.2.40) we may 
replace k' x ■ n by kf. The functional determinant is given by 


d ? 2 


dr 2 



(6.2.44) 


As earlier, we may forego integration over k x ,k’[ in (6.2.40), in which we use the 
d function to obtain 


m 


(k i - *?) ■ ^2 + — eo\r ~ r 2 \J = (k 1± - kl L )-r 21 . 

Thus, under the same assumptions about <k' x , r 2 ±| ^ 20 1^ 21 ) we obtain 
the impact probability 


m Wf df 


k\° 


T( — k°e, — k x e 0 
1 m x m x 


r — r 2 \ 2 ( m N 3 


m. 


"* \ 1 , m. 

— 1 + -e 0 -e 


m. 


J~~ °l ^ro|k'(, 0> dk x dk\. (6.2.45) 


From (6.2.42) and (6.2.43) it follows that 

12 


r 2 — \r — r 2 \ 2 


m m _ 
e H-e 0 


m 


m 2 


k\° = k°x 


m 

m. 


m 


e H- e 0 

m 2 


(6.2.45) therefore takes on the following form: 

12 


m 


t fe?( 27 tr r 

m, J ^ 


m ^ m _ 

— e x H-e 0 

m x m 2 


do(e 1 ) / \ 3 / \ 

“ f—) (l + 

mi/ \ m 2 J 




m, 


mi 


1 


<5(^ (k? - kf)) <ki, 0| VF) 0 |k'(, 0> dk x dk’[. 


(6.2.46) 
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Equation (6.2.43) fixes the “auxiliary” direction e in terms of the experi¬ 
mentally determined measurement directions and e 0 ; using the abbrevia¬ 
tion A = r/1 r — r 2 | we obtain 


m _ m _ 

Ae, — — e - e 0 and 

m l m 2 


- 2 , m _ m _ 

A = I — e H-e, 

\m, m 


«) 2 


(6.2.47) 


Let 6 denote the experimental value of the scattering angle. We therefore 
obtain e x ■e 0 = cos 9. The auxiliary angle Sis introduced as the angle between 
e and e 0 , that is, e ■ e 0 = cos 9. Thus from (6.2.47) it follows that 


m _ m _ 

— e H-e 0 

m 2 


m . . m, 

— 1 + 2 —i cos 9 + 
m x m 2 



1/2 


Therefore (6.2.46) can also be written in the form 


f Me t ) 

m i Jan 


1 + 2 ^ cos 9 + ( — 
m 2 


/mA 2 

\m 2 ) _ 


3/2 


1 + — cos 9 
m 2 


T(— k ( }e, k(e 0 
m x mj 


1 


^2^(M - fc'i 2 ) )</c l5 0| W' 10 |fe x , 0> dk,dk![- 


(6.2.48) 


According to (6.2.47) the scattering angle 6 is obtained from the auxiliary 
angle 9 as follows 


tan 6 = 


sin 9 

m 1 /m 2 + cos 9 


(6.2.49) 


Here we have provided detailed derivations of several formulas for the 
impact probability of systems on surfaces for scattering experiments in order 
to illustrate the following problems. 

First, it is important to realize that the mathematically idealized limiting 
process described in §§2 to 5 for the comparison between theory and experi¬ 
ment must be reduced to its actual physical meaning, that is, to finite times 
and finite distances. The necessary comparisons between the idealized 
approximation of the scattering operator and the exact behavior of the wave 
packet cannot be carried out because of the level of difficulty. Because of the 
small range of the interaction, the idealized approximation obtained by 
using the scattering operator is very good. 

Second, it is important to realize that it is not sufficient to carry out an 
arbitrarily designed scattering experiment, because implicit in the formulas 
is the structure of the ensembles W\ 0 , W 20 . We must, in addition, carry out 
additional experiments in order to obtain information concerning the nature 
of the beams W\ 0 and W^q of systems 1 and 2, respectively. We must there¬ 
fore always assume that experimentally determined facts must be included in 
the mathematical form of the theory, as we have discussed at the beginning 
of this section and expressed as the so-called assumptions about W\ 0 , w‘ 20 . 
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A third point is evident in the derivations: In order to use simple formulas 
such as (6.2.24), (6.2.27), (6.2.35) and (6.2.48), additional approximation 
steps are necessary in which, again, because of the level of difficulty, we must 
omit even rough estimates of the accuracy of the approximation. In particular, 
it is evident that the ratio of the distances of the detection surfaces to the size 
of the collision region is more important than the range of the interaction. 
However, the concept of the size of the collision region is already an 
“approximation”, because every wave packet is, for almost all times, mathe¬ 
matically extended to infinity. In this relationship, the dispersion of the wave 
packet (see X, §§1 and 2) is such that some of the above formulas cannot be 
correct for “arbitrarily large” distances. 

The previously mentioned estimates of the approximation are described in 
experimental physics by the use of the catchword “error estimate”. For con- 
ceptial clarity, we emphasize the fact that experiments have, in themselves, 
no “errors” since they are the real facts. “Errors” refer exclusively to the 
comparison between experiments and mathematically deduced formulas. 
Such “errors” may have three different origins. First, every mathematical 
theory, that is, every JiT from a TT (see I) represents reality imperfectly. 
Second, real experiments are, in general, not the experiments which one 
“wanted” to perform in order to test a mathematical relation, which was 
derived in JIT. Third, the relationships derived in JIT are often approxi¬ 
mations to an exact relation which is too difficult to derive. The first two 
points cannot be easily distinguished, since the mapping principles {JUT) 
(see I) for experimental facts which cannot be represented in digital terms 
require the use of imprecision sets (see [8], §6 and both [6], II and [6], III, §5). 
The third point is not concerned with the conceptual structure of a TT, but 
only an often used mathematical approximation procedure, which just 
permits us to apply a JIT to a reality domain iV. 

If the complex structure of the question of the comparison between theory 
and experiment were made clear, then we would understand, that we would 
prefer, for the most part, to apply methods which are not very precise but 
would lead more quickly to the results which we shall obtain in the following 
section. 

In closing this section we shall extend the results obtained above by 
showing how the formulas for the “impact of the microsystems after 
scattering” have to be changed for the case of two “identical” systems. First, 
we must consider how (6.2.1) is to be changed since the apparatus symbolized 
by A T cannot distinguish between the two “identical” systems 1 and 2. 

The purely “symbolic” introduction of the map T 0 in VIII (3.5), VIII (3.7) 
is of little help because we have to consider the question of the response of a 
concrete apparatus. 

Since two systems can occur, we consider the more general case of an 
apparatus with two detection surfaces AT a and AT b . We then obtain the 
following coexistent effect processes: In the time interval J a a system strikes 
AT a ; in the time interval J b a system strikes AT b , and all logical combina¬ 
tions of the latter. 
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We now consider the following logical combination: In the time interval 
J a a system strikes AJ^ AND in the time interval J b a system strikes AJ b , 
where AJ a and A J b are two nonoverlapping portions of a spherical surface 
centered on the scattering region. To this effect we assign the effect 

F{J a , A.FJ x F(J b , AJD + n't, x W,, AJ£). (6.2.50) 
(6.2.50) is an effect because F(J a , A.FJ and F(J b . A.F b ) cannot simultaneously 
occur, that is, since 

F(J a , Ajg + F(J b , AJ|) < 1. 

It is somewhat more difficult to discuss the case in which A.¥ a = Athat is, 
only one detction surface is present A:¥ = A2F a = A3F b . If we assume that the 
apparatus characterized by A.F has sufficiently high time resolution that both 
systems can be separately counted, then from J a n J b = $ we will obtain 
the same formula as (6.2.50) where AAF = A,F a = A,¥ b . The case that both 
systems strike the surface A S' simultaneously can, for sufficiently small A#; 
for all practical purposes, be excluded. Thus we do not have to consider the 
form of such an effect for such a simultaneous impact on a small surface ASF. 

For the case of multi-channel scattering theory (see, for example, §5.3) in 
which one of the two systems is bound on the nucleus after scattering, the 
probability for (6.2.50) will be zero. Thus only one of the two systems will 
strike the surface ASF. What is the effect operator to be used to describe this 
impact? 

In complete analogy to (6.2.50) we shall select 

F(./, A;F) x 1 + 1 x F(J, AJF) (6.2.51) 

as the effect of the impact of a system on the surface A,¥ in the time 
interval For the ensemble used above (one system remains bound) the 
operator (6.2.51) acts like an effect because F{J, A.F) x F(J, A# - ) has 
expectation value 0. If we would like an effect operator which can also be used 
for the case in which both systems are not bound after scattering we may (for 
sufficiently small A.F) select 

F(J, A#) x (1 - F(J, A.F)) + (1 - F{J, A.F)) x F(J, AjF) 

+ F(J, A#') x F(./, A#") 

= F(J, A,F) x 1 + 1 x F(./. A.F) - F(J, A#") x F(./, A.F), (6.2.52) 
which can be replaced by (6.2.51) if the expectation value of 

F(./, A.F) x F(./, A#') 

is zero. 


6.3 The Scattering Cross Section 

One of the most important experimentally measurable quantities in scattering 
theory is the so-called scattering cross section. This quantity is appropriate 
for experiments which are configured in such a way that the application of 
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the formulas (6.2.27) or (6.2.35) or (6.2.48) is permissible. Then the applica¬ 
tion of the formula (6.2.28) for test experiments with respect to W\ 0 will be 
appropriate. 

The following expression follows directly from (6.2.28): 

~ 0| W\ 0 \k’[, 0> dk 1 dk'{ (6.3.1) 

and is called the “beam density” for the ensemble. (6.3.1) is the probability 
density that a system of type 1 will impinge on the surface normalized with 
respect to a unit surface area. 

We now consider case (1), that is, m 2 > m 1 . The differential cross section 
da is the ratio of the quantities defined by (6.2.27) and (6.3.1), that is, 

da(e) = (2n) A m\ \ T(k\e, k° { e 0 )\ 2 da>(e), (6.3.2) 

where in (6.2.27) we have assumed that | T(k°{e, k^e^ | is so weakly dependent 
on k x and that (k u 0 \ W 20 \ku 0> has such small dispersion in k u that 
| T(k 1 e l , k{e 0 )\ 2 , upon integration over can be replaced by | T(k° 1 e, k?e 0 )| 2 . 
Then, using (6.2.7&) we may rewrite (6.3.2) as follows: 

da(e) = \g k °? 0 (e)\ 2 dco(e). (6.3.3) 


Using the special notation f k (6) for g$e) in (5.1.6), (6.3.3) is transformed into 

da(e) = \f k «(6)\ 2 da)(e), (6.3.4) 

where 9 is the “scattering angle”—the angle between e and e 0 . 

For (6.2.35), that is, for “collisions in the center of mass system ( K° = 0)” 
we obtain the same result: 

da(e) = m 2 (2n) A \T(k.° x e, /c°e 0 )| 2 dco(e). 


Using (6.2.7a) we may rewrite this expression as follows 

Me) = 1 9k o, e 0 (e) 1 2 d<o(e) (6.3.5) 

or (6.3.4). The reduced mass m no longer appears in da(e )! 

For the case in which systems 2 are at rest before the collision (6.2.27) is 
a good approximation for (6.2.48) only for the case in which m 2 > m If this 
is not the case, then from (6.2.48) it follows that 


da(e j) = m 2 (2n) A 


1 + 2 — cos 9 + 
m 2 



3/2 


1 + — cos 9 
m 2 


x 


T[ — kVeAkVeo 

\rrii m 1 


2 

da)(e x ) 
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from which, using (6.2.7a) and (5.1.6), we obtain 


da(e t ) = 


1 + 2 — cos 3 + (— 
m 2 


( m iY 

\m 2 ) 


3/2 


1 + — cos 9 
m 2 


\ f m k<!($)\ 2 do^i), (6.3.6) 


where the angle 9 is related to the scattering angle 9(9 is the angle between 
e 0 and e 1 !) by the formula (6.2.49). 

These formulas may easily be applied to the examples in §5; for this reason 
we shall not consider (6.3.6) any further, except to note that it is of great 
importance for experiments. We have derived it here because of its principle 
importance in illuminating the conceptual foundations and the approxima¬ 
tion schemes which are required in order to obtain (6.3.6). We now return to 
(6.3.4) in which we need only determine | f k (9) | 2 . 

From (5.1.11) it follows that 

IA(0)l 2 = £ 

l = 0,l' = 0 K 


■ {-i) l - l ' e - i(nl2W - l \e 2id,(k) - l)(e' 2idl ™ - i). (6.3.7) 

This formula is practical only in cases in which it suffices to consider only a 
small number of terms in l and /', that is, 3 t (k) * 0 for large values of /. The / 
value for which S v (k) a 0 for t > l can be obtained with the aid of the range 
of the interaction R as follows: For 

/' kR we obtain S v (k) ~ 0. (6.3.8) 

For the special case in which kR <£ 1, then one term / = 0 suffices and we 
obtain 


\fk(6)\ 2 = £2 s in 2 d 0 (k). 


(6.3.9) 


In this case the differential cross section is isotropic. 
The “total cross section” is defined as follows: 


= jda(e). 


(6.3.10) 


From (6.3.7) it follows that, from the orthogonality of the spherical harmonics: 

47T 00 

a = ^ T (21 + 1) sin 2 S,(k). (6.3.11) 


( = 0 


From (5.2.2) it follows that the differential cross section for the scattering 
of an electron by an atomic nucleus is given by 


da = 


2 \ 2 


1 


mZe 


Ik 2 ) sin 4 (9/2) 


da>. 


(6.3.12) 
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Here it is important to note that the Rutherford scattering formula (6.3.12) is 
not an approximation, even though (5.2.2) was obtained by the use of the 
Born approximation. It is also possible to obtain (6.3.12) by using the exact 
eigenfunctions of the continuous spectrum given in XI, §5 (see, for example, 
[21]). The same result may also be obtained from classical mechanics (see, 
for example, [2], XI, §1.1). These results are a consequence of the special 
properties of the Coulomb potential. 

From (6.3.12) it follows that the total scattering cross section a = oo. We 
note, however, that (6.3.12) is not meaningful for small 6 in actual experiments, 
because the actual experiments are carried out by using apparatuses which 
have finite dimensions. In order to test (6.3.12) for very small angles, we 
would need a completely empty(!) space of thousands of kilometers between 
the preparation and registration apparatus. For the case of a Coulomb field 
the total cross section is not a characteristic of the Coulomb field but of the 
structure of the actual experimental targets. 

In closing we now pay attention to the results of §5.3. From (5.3.4) it 
follows that for W = Q_ JFoQ± : 

W = i£Pn_^ + xPo + Pci <i> + x ^i]- (6.3.13) 

Thus, using <\>t£ri,r 2 ) defined according to (5.3.5), where we use k as an 
index, we obtain 


and 


Q_iA + 


1 



[<Mu, r 2 ) + 4 >g(/ 2 , rj] d 3 k 


= (2^/^ 71 ?z) “ u?2 ’ " i)] (6314) 

Let Fj denote the effect of the impact of a microsystem on a surface given in 
§6.2. Then, from (6.2.52) and (6.3.13) it follows that 


t^VF^Fj x 1) + (1 x F 2 ) - F x x F 2 ] x 1) 

= |[<£2_iA + ,(F 1 x 1)Q_.A + > + 3 <Q_iA^(F 1 x l)f2-«A->]- (6.3.15) 

In (6.3.15) we have used the fact that F^fo) = 0 (see also (6.2.51)) since 
the effect F, is measured for large | r l |. From (6.3.15), using the approxima- 
mation (5.3.16) together with the asymptotic forms (5.3.13) for 

xO") = <^i> 0, 0; r|</>>, 

and similarly, (5.3.19) for rj(r) we obtain the differential cross section for 
elastic scattering: 

d<r(Z) = l\f k (6)\ 2 + \h k (9)\ 2 - HMOMO) + mhjem date). (6.3.16) 
Using (5.3.20), (5.3.23) we may compute an approximation of (6.3.16). 
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The amplitude h k (6) describes an “exchange effect” for the pair of electrons. 
If | h k (9) | | f k (0) | then this effect can be neglected, and we obtain the 

approximation 

dcr(e) = \m\ 2 dcD(e), (6.3.17) 

that is, the same result we have obtained in (6.3.4). The cross section for the 
scattering of “slow” electrons on atoms (a similar derivation to that in §5.3 
holds also for atoms other than hydrogen atoms) is therefore similar to a 
potential scattering. From the formula (6.3.9) it follows that the total cross 
section for very slow electrons can “accidently” be very small for certain 
energy values, that is, if sin 5 0 (k ) « 0 (see, for example, [22]). 

From the asymptotic form of the function <£„, /, m; r|</>> defined in §5.3 

<£„,/, m;r|<£> & f k , n ,Je )-—, (6.3.18) 

r 

noting that (5.3.24) represents an approximation of (6.3.18), we may compute 
the total cross section er„ for the excitation of the energy level E n as follows: 

= T f X X \fk,nim(e)\ 2 dco(e). (6.3.19) 

K J 1 = 0 m= —l 

In order to measure this cross section we need to be able to experimentally 
distinguish between the different energies k 2 /2m, k 2 /2m (n = 2, 3,...), a topic 
which we shall consider in the following section and in XVII, §6.2. 


7 Survey of Other Problems in Scattering Theory 

In this section we shall only present a brief overview of other problems in 
scattering theory. The variety of different scattering experiments for which 
quantum mechanics provides a precise description (providing the corres¬ 
ponding mathematical problems can be solved to reasonable accuracy) is vast. 
All possible atoms and molecules can be scattered against each other. In 
addition to two-body problems it is also possible to consider three-body 
problems. 

While it was possible to present a brief overview of the structure of 
“bound” atoms and molecules in XI-XV, such an overview of scattering 
problems is not possible without the need for an additional book, the scope 
of which would be far larger than the present book. Why is there a disparity 
between the ability, on the one hand, to describe the structure of atoms and 
molecules, and on the other, to describe the scattering process? 

The difference lies primarily in the fact that the structure of atoms and 
molecules is, for the most part, determined by the discrete spectrum of the 
Hamiltonian operators for the different system types; in particular, the 
structure of a molecule is often determined by the behavior of the ground 
states of the Hamiltonians. The discrete character of the energy levels makes 
it possible to make qualitative statements about their structure based upon 
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symmetry considerations and by performing an imaginary “continuous 
variation of parameters.” We have already considered these in XI to XV. 

For the case of the continuous spectrum the situation is completely 
different. The spectrum is continuous, and, as such, has no structure (except 
for degree of degeneracy [which is always infinite] and the decomposition of 
the spectrum into absolutely continuous and singular-continuous com¬ 
ponents). The physical significance of the continuous spectrum has to be 
determined from considerations other than the “possible” energy eigen¬ 
values. The physical significance is associated with questions which arise in 
scattering theory. It is, of course, possible to obtain general statements about 
the structure of the wave operator and the scattering operator on the basis 
of the symmetry of the Hamiltonian operator; in §5.1 we have considered 
examples of such. Readers who are interested in the implications of sym¬ 
metry for the case of scattering theory are referred to the specialized 
literature [23], 

Symmetry considerations can be very useful for scattering theory because 
they often reduce the work necessary in solving problems and, for many 
applications, there are important symmetries associated with cross sections. 
In addition to rotational symmetry, reflection and time-reversal symmetries 
play an important role (see, for example, [23]). 

Symmetry considerations, on the other hand, provide no insights into the 
structure of the scattering process, neither in principle or quantitatively. This 
has been shown in the general statement of the problem in §4 and the survey 
of particular scattering problems in §5. 

We now come to an additional distinction—the measurement methods 
needed in scattering experiments. The discrete terms of the Hamiltonian 
operator determine the discrete frequencies of the emitted radiation in the 
simplest way—and already provide experimentally provable results without 
making it necessary to provide a precise analysis of the function of a spectral 
apparatus. With the aid of simple arguments it is possible to obtain theoretical 
expressions for the relative intensities of the spectral lines (see XI, §1). In the 
case of the scattering process we must give the measurement process a great 
deal of thought, as we have done in §§6.1 and 6.2. In fact, the registration 
processes of “striking a surface at time t” discussed are not the only ones used 
by experimental physicists. For example, the surface considered in §6.1 can be 
experimentally achieved by constructing a hole, through which the system 
enters into a complicated apparatus. This apparatus may be such that it is 
possible to measure the energy of the incoming systems with, for example, 
the aid of fields (see XVII, §6.2). For such a registration there is the problem 
that there does not exist a coexistent registration procedure which can resolve 
both times of incidence and energies to arbitrary accuracy. Often this fact is 
expressed in terms of a Heisenberg uncertainty relation for energy and time: 

AtAE>i (7.1) 

Although (7.1) appears to have the same form as the uncertainty relation 
between position and momentum (see IV, (8.3.18)), there are essential 
differences. 
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First, we note that time is not a scale observable like position and 
momentum, a fact which is physically reasonable but is often (at first) 
disturbing to many people (see VII). Second, the relation IV (8.3.18) refers to 
the preparation, because it represents the dispersion of position and 
momentum in all possible ensembles, as we have discussed in IV, §8.3. Of 
course, a relationship which is similar to IV (8.3.18) must hold also for 
“measurement errors”, and in IV, §8.3 we have interpreted them in this way. 

We may also interpret (7.1) as an assertion about the preparation process, 
that is, about the ensemble. Then we must define what we mean by At and 
A E more precisely. The probability for striking a surface at time t can only be 
meaningfully defined with the help of (6.1.13) for free moving systems. For 
W in the K representation we will therefore define 


E = f (k\W\k) d z k and (A E)‘ 
J 2m 




<£|W|fc> d 3 k. 


(7.2) 


Similarly, for F defined by (6.1.13), we may introduce At. In order to be able 
to describe these simply, we introduce the following projection operators. 
First, in the K representation we define 


P + <k#> 


f<£|^> if k 1 > 0, 
(0 iffcjCO. 


(7.3) 


We define a mapping of P+(k\ ij/} on t](t, x 2 , x 3 ) by 
r](t, x 2 ,x 3 ) 

= — 1 —. C dk 1 r dk 2 f dk 3s /kle i(k2X2+k3X * )(i/2m)k2t <£|(A>- (7.4) 

yj YYli^l.Tiy’ J 0 v —oo J —oo 


It then follows that 


/•OO /»00 /»00 

dt dx 2 dx 3 \rj(t, x 2 , x 3 )\ 2 

J — 00 J — OS j — 00 

= Cdk, r dk 2 r dk 3 \(k\^y\ 2 . 

JO J — oo J — oo 


We define the projection operator Q y as follows 
Q**l(t,x 2 ,x 3 ) = 


ri(t,x 2 ,x 3 ) if (x 2 , x 3 )e 2F, 
0 otherwise. 


(7.5) 


(7.6) 


Let A be the multiplication operator (with t) applied to r\ according to (7.4). 
We define the following, on the basis of (6.1.13) in the following obvious way 

l = a~ l \x(Q,P+WP+Q,A), (7.7) 

(At) 2 = a” 1 tr(Q^P + WP + QAA - tl) 2 ), (7.8) 

where the normalization factor a is given by 

a = tr(Q^P + WP + Qr). (7.9) 
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A relationship of the form (7.1) can be anticipated only if, in the definition 
of A E in (7.2) we replace the ensemble W by the ensemble 

ff = a~ l Q,P + WP + Q, (7.10) 

because not all of W (but only W) contributes to the measurement of the 
incidence of systems on the surface 3F. 

If we apply the operator W and the operator B = i(d/dt) to rj using (7.2) 
we obtain 

E = tr(#B), 

1711) 

(AE) 2 = tr (W(B - El) 2 ). 

From (7.7), (7.8) we obtain 

t = tr (WA), 

(7 12) 

(At) 2 = tr(W(A — t- 1) 2 ). 


On the basis of the commutation relation AB — BA = (l/Ql, from IV (8.3.17) 
we obtain (7.1). 

In order to avoid errors, we observe that the mapping defined by (7.4) does 
not establish an isomorphism between the Hilbert spaces if 2 (R 3 , d 3 k) and 
if 2 (E 3 , dt dx 2 dx 3 ). In order to derive relation (7.1) for ensembles of the form 
# (7.10) it suffices to prove that the map (7.4) is well defined. 

The relation (7.1) implies that it is only interesting to experimentally 
analyze the time incidence after the scattering, that is, to make experimental 
comparisons between the formulas (6.2.23) and (6.2.26)—if At in (7.12) (for 
W before scattering) is not greater than the theoretically expected difference 
in the time structure between (6.2.23) and (6.2.26). In particular, it does not 
make sense to prepare decaying states in the sense of XVII, §6.5 if, before the 
scattering At > t (where t is the decay time; see XVII, §6.5), that is, A£ < r _1 . 
On the contrary, we must choose AE $> r _1 . Experimentally, for the usual 
values of r, there is, of course, no difficulty, because it is experimentally 
difficult to make AE small. 

For the registration of scattering processes, not only impacts on surfaces 
but also completely different methods (such as cloud chambers, bubble 
chambers, etc.) are used, the effect procedures are not as simple as those 
described in §§6.1 and 6.2. The “evaluation” of such experiments is somewhat 
more difficult. Here by “evaluation” we mean nothing other than obtaining 
a good (more or less) determination of the effect operator F corresponding 
to the procedure. What theory should we use to carry out such an evaluation? 
With the help of quantum mechanics, or with classical theories which precede 
quantum mechanics? In order to obtain another step in the answer to this 
question, in the following chapter we shall investigate certain partial aspects 
of the physical processes of preparation and registration. For this purpose 
we shall require, as a fundamental assumption, the description of the 
scattering process presented in this chapter. 

Scattering theory raises another question concerning the possibilities of 
preparing and registering—a question which is of great importance for our 
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view of quantum mechanics. According to the description of scattering 
experiments presented in §1, it appears that it should be a simple matter to 
prepare experimentally an ensemble of composite systems in such a manner 
that it has the form (1.15) before the scattering, that is, it has the form 
W = x W 20 )Qt where Q_ is the wave operator introduced in §4. 

Our question is as follows: Is it possible to prepare the ensemble 
W = CQ-(W io x Here C is the time reversal operator intro¬ 

duced in X, §4. What is the structure of W? From CQ C 1 = Q + it follows 
that W = Q + [CW 10 C _1 x CH / ^ 0 C~ 1 ]Q+. W is therefore an ensemble 
which “after” the scattering is equal to CW\ 0 C~ > x CW 2U C^ K 

If, for example, W\ 0 , W‘ 20 describe a pair of colliding beams of systems 
1 and 2, respectively, then we must seek to “prepare” W such that, “after” 
scattering, two outgoing beams CW\ 0 C ~ 1 and CW 20 C ~ 1 arise. This would 
mean that, before the scattering, “infalling spherical waves” must be prepared 
which have two strongly correlated spherical waves of systems 1 and 2, 
respectively, in such a way that after the scattering two uncorrelated separated 
beams CW\ 0 C ' \ CW^C" 1 are created. To satisfy this problem will 
certainly be considered hopeless by any experimental physicist. Have we 
encountered a principle impossibility in the preparation process? 

If yes, then quantum mechanics would not be (at least for composite 
microsystems) a closed theory (g.G. closed theory in the sense of [8], §10.3). 
Then, we need to seek a more comprehensive theory which describes what 
really can be prepared. 

Where is the place where we have introduced requirements about the 
preparation possibilities which are too strong? 

It does not appear to be difficult to prepare the two diverging beams 
CW\ 0 C ~ 1 and CW^C -1 as described earlier (as illustrated in Figure 38). 
How does the preparation according to Figure 38 differ from the preparation 
of W by preparation apparatuses before the scattering? 

Let a % 6 2! be the preparation which corresponds to Figure 38, then 
(p(a^) = W where (p is given in III, D 1.3. In very good approximation 
(piaf) ~ CW 10 C~ 1 x CW' 20 C - 1 , since a l can only be combined with those 
registration methodes b 0 e (II, D 4.3.1) which register after the preparation, 
where we may replace W by CW\ Q C '~ 1 x CW l 20 C~ 1 . 

We seek a preparation procedure a 2 which is equivalent to a t (that is, 
< p(a 2 ) = <p(aj) = W) and which prepares before scattering, and may be 
combined with more registration methods b 0 than a v Our reason to doubt 
that such a possible a 2 exists has something to do with Axioms APS 5.1.3 and 
APS 5.1.4 from III, §1. If we examine the arguments of III, §1, which lead to 
the introduction of APS 5.1.4, we will find that our doubt about the existence 
of such an a 2 merely raises questions concerning that axiom. Then if b 0 is a 
registration method which cannot be combined with a, because b 0 already 
registers before a l prepares, then the equivalence class W to which a 2 
belongs does not necessarily contain ana 2 which may be combined with b 0 . 

The above problem is closely related to the question of the time translation 
(1,0,0, y) of registration procedures. If b 01 is a registration method which can 
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be combined with a u then for negative y there does not necessarily exist a 
a 2 e which can be combined with the translated registration method 
boi = ( 1 , 0 , 

The form of a more comprehensive theory than quantum mechanics is, 
as yet, unknown. In [7] there is a proposed solution for systems composed 
of “many” elementary systems. For the problem of smaller composite systems 
we can only refer readers to [7]. 
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CHAPTER XVII 


The Measurement Process and the 
Preparation Process 


In our terminology the above chapter heading probably should be called 
“Registration Procedures and Preparation Procedures” because it is con¬ 
cerned with a theoretical treatment of the phenomena which we have 
described in this book beginning with II, §4. Nevertheless we have used the 
expression “Measurement Process” because the literature commonly refers 
to the problems which are treated here as the “measurement process in 
quantum mechanics”. Actually, the usual quantum mechanics viewpoint is 
too one-sided, emphasizing the measurement process (in our terminology— 
the registration process). This onesidedness is related to the philosophical 
meaning and interpretation of quantum mechanics, where the measurement 
process is, of course, a central issue in the problem of meaning. 

Because of size limitations, in this book, we find it impossible to provide 
an extensive survey of the possible meanings of quantum mechanics and 
their corresponding viewpoints concerning the measurement process (see, 
for example, [24]). In order to remedy an underlying shortcoming in our 
presentation, we will now briefly describe the measurement process in terms 
which do not correspond to the principles described in this book. 

According to this viewpoint, the problem of measurement is that of 
measuring one or more unknown “properties of an object”, that is, determine 
whether or not the object under consideration has the given property. If the 
measurement is the “determination of a given property” then the objects 
must “have” these properties (at least at the time of measurement). Of course 
these properties can be changed shortly after the measurement by perturbing 
the object (by the measurement apparatus). By an ideal measurement process 
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we mean a measurement process in which a post-measurement perturbation 
of the measured quantity does not occur. In this viewpoint the projection 
operators are considered to mathematically characterize these properties. 

This viewpoint contradicts the principles underlying our formulation of 
quantum mechanics and its interpretation in II, §4. In our viewpoint the 
preparation and registration procedures are the basis for the interpretation 
of quantum mechanics and are not a means for the determination of the 
properties of micro-objects (registration procedures) or of the production of 
micro-objects having the desired properties (preparation procedures). On 
the contrary, we have seriously questioned whether it is possible to attribute 
objective properties to microsystems (III, §4 and IV, §8). Therefore the struc¬ 
ture of preparation and registration procedures is not a means by which we 
may experimentally confirm or refute statemehts about previously interpreted 
“micro-objects having properties”. Instead, concepts such as “object”, 
“property”, “pseudoproperty” can only be given meaning with the help of 
preparation and registration procedures. Therefore, in this connection, the 
expression “theory of the measurement process” does not mean to explain 
how we may measure properties, but how we may make a transition to a 
more comprehensive theory (in the sense of [8], §8) than that described earlier 
in this book. Here we would like to obtain a theory which will permit us to 
use information about the technical construction of the preparation apparatus 
and registration apparatus in order to determine the operators W = <p(a) 
and F = \j/(b 0 , b). The fact that we cannot do this in this book reflects the 
current state of affairs. At present we must “guess” the operators W and F, 
and then axiomatically assert them for certain procedures (see, for example, 
XI and XVI). The fundamental difficulty for a complete solution lies in the 
fact that we do not know how such a desired theory would be related to 
quantum mechanics. The first section of this chapter will provide a brief 
survey of these difficulties, the solution of which can be found in [7], 

1 The Problem of Consistency 

In I we have already described a difficulty in the problem of formulating an 
axiomatic basis of quantum mechanics. For microsystems nothing can be 
directly measured. A more precise formulation of the notions of direct and 
indirect measurements can be found in [8]. Here we shall only intuitively 
motivate the difference between these notions, without referring to [8]. 

By a direct measurement we mean the determination of facts without use 
of the 3P2T under consideration—here without the use of quantum mechanics. 
Of course this does not mean that we cannot use other theories (so-called 
pre-theories). Thus, with respect to quantum mechanics we may use, for 
example, classical electrodynamics as a pre-theory, in order, for example, to 
speak of a specific electric current. 

The so-called classical theories are characterized by the fact they are 
theories of “physical objects” (in the sense of III, §4.1 and [8], §12) whose 
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objective properties can be directly measured. The same is not true for the 
case of quantum mechanics. Indeed, the microsystems themselves are only 
indirectly determinable physically real facts. We have not taken the last fact 
into account in our formulation of an axiomatic basis from Chapter II 
onward, since we have introduced the set M of microsystems as our basis set. 
We have permitted this inconsistency in order not to make our representation 
more difficult. In [6], XVI we briefly outline and in [7] we provide more detail 
how this inconsistency can be alleviated. The measurement problem in 
quantum mechanics would not exist if it were possible to consider micro¬ 
systems and their properties to be directly measurable. 

By indirectly measurable we mean those facts which may be obtained with 
the aid of the theory under consideration and can be considered to be real 
physical facts, where the determination of indirectly measured facts are 
obtained on the basis of direct measurement and by making deductions 
within the framework of the theory. 

The questions concerning the indirectly determinable facts is not, however, 
a problem of measurement, as we shall see later in this chapter. The facts 
obtained from indirect measurements (on the basis of direct determinable 
preparation and registration procedures) has been the subject of Chapters 
III-XVI of this book, beginning with the introduction of ensembles, effects, 
decision effects in III, continuing with the discussion of observables and 
preparators, the objective properties and pseudoproperties in IV until the 
speciaj structure investigations of atoms, molecules and the scattering pro¬ 
cesses in XI to XVI. 

What more shall we expect from the discussions of this chapter? We shall 
develop a theory which permits the theoretical description of the interaction 
process between the preparation apparatus and the microsystems on the 
one hand and the interaction process between the registration apparatus and 
the microsystems on the other hand and therefore develop a more compre¬ 
hensive theory of the maps cp and ^ from III, D 1.3. But with the help of what 
theory? 

For obvious reasons we shall use the formulation of quantum mechanics 
developed in II-XVI, because this theory, according to VIII, permits us to 
combine systems to form new systems. Thus it is possible to combine micro¬ 
systems in order to form larger systems such as preparation and registration 
apparatuses. This means nothing other than introducing the apparatus 
themselves as elements of M. In this way we encounter the problem whether 
macrosystems themselves (such as preparation and registration apparatuses) 
can actually belong to the fundamental domain of quantum mechanics, as 
the concept of fundamental dqmain is introduced in [8] and [6], III. The 
fundamental domain is, however, the domain of experience upon which SPOT is 
applicable. This problem is not trivial. Adherents to the “universality” 
doctrine—those who believe that quantum mechanics is universally valid- 
claim that quantum mechanics remains valid for macrosystems, while 
sober-minded individuals claim that it is necessary to seek a theory of macro¬ 
systems which is more comprehensive than the quantum mechanics of many 
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particles. Such a theory of macrosystems should be compatible (in a yet to 
be described way) with the theory of many-particle quantum mechanics. 
Since we encounter the problem of the relationship between quantum 
mechanics extrapolated to many particles and a theory of macrosystems by 
asking questions concerning the measurement process, our goal of obtaining 
a deeper theoretical description of the preparation and registration process 
is very remote. An outline of the problem of the relationship between 
quantum mechanics and macrophysics can be found in [8], XV, XVI and 
[7], X, XI. We shall again return to the general problem of the compatibility 
of quantum mechanics (as developed in II-XIV) with a theory of macro¬ 
systems in XVIII. Here we shall restrict our considerations in order to attain 
a simpler goal. We will only show that quantum mechanics is self-consistent, 
that is, the description of the interactions of microsystems does not lead us 
out of the realm of quantum mechanics as described by the scheme presented 
in II. This “special consistency problem” will be discussed in the following 
sections. It will lead us to considerable progress in the direction towards a 
theory of the preparation and registration, not only in terms of epistemo¬ 
logical questions but also with respect to the actions of experimental 
physicists. 

An enrichment by additional structures of the theory described earlier (up 
to XVI) is made possible by the observation already made in I, §2 that it is 
possible to divide many experiments into preparation and registration 
portions in many different ways. In this way we obtain relationships (between 
the maps rp, if/ in III, D 1.3) which are determined in a purely quantum 
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mechanical way by the Hamiltonian operator. We will not examine the 
general problem of obtaining a mathematical description of the different 
possible ways of dividing an experiment into preparation and registration 
parts, but only present a few typical examples. For our first example we shall 
consider the scattering experiment described in XVI consisting of two 
preparation apparatuses and a registration apparatus. In XVI we have 
considered the two preparation parts (1) and (2) as a single composite 
preparation apparatus for scattering experiments (see Figure 41). We may 
also consider (1) to be the preparation part and both (2) and (3) to be a com¬ 
posite registration apparatus (see Figure 41). 

The special consistency problem consists of introducing by axioms 
additional structures for the different possibilities of dividing experiments 
into preparation and registration parts and that these axioms do not contra¬ 
dict previously introduced axioms (especially in II and III). The fact that this 
is not completely trivial we shall see in the following section when we try to 
strengthen the axioms in III. 

2 Measurement Scattering Processes 

In our terminology the expression “measurement scattering process” refers 
to the use of the scattering process for the purpose of registration. Such 
processes are not only of considerable importance for the clarification of the 
measurement process but are also frequently used in experimental physics. 
Before we formulate the general notion of measurement scattering processes 
we will first consider a simple example in which it appears. In order to make 
the example as simple as possible, we shall choose a “gedanken” experiment. 

2.1 Measurement with a Microscope 

Our “gedanken” experiment will consist of the determination of the position 
of an electron with the aid of a light quantum (photon), a microscope and a 
photographic plate (see Figure 42). The only imaginable possible (strongly 


X image plane 
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idealized) version of such an experiment requires that the electrons to be 
measured are confined to the object plane of a microscope, but they are free 
to move within the plane, that is, we are considering two-dimensional 
electrons. 

We wish to determine the statistics associated with the following experi¬ 
ment: A light quantum with an approximate experimentally determined 
momentum value p impinges onto the object plane from an approximate 
well-determined direction. It is then scattered by an electron, passes through 
the microscope and blackens a silver grain near X (see Figure 42) on the 
photographic plate which lies in the image plane of the microscope. The 
position X of the blackened silver grain will, of course, be different for 
different trials. Here we wish to make it clear that the statistics for the position 
X of the blackened silver grains permits us to determine the statistics for the 
position of the electrons in the ensemble IF which was used in the experiment. 

Now it is necessary that we experimentally fix a time t x (at least approxi¬ 
mately) at which we wish to measure the position—or more correctly—that 
we arrange the experimental system in such a way that it corresponds (at 
least approximately) to a measurement of Q(t x ). Here we may think of a 
situation in which the photons are generated by light flashes which illuminate 
the field of the microscope during a time interval f, — r/2 to t x + t/ 2. Here 
we note that r will become larger as the momentum p of the light quanta is 
more precisely defined. Here the velocity of light (in the units chosen here) is 
taken to be 1; according to the Heisenberg uncertainty relation the position 
uncertainty t of the photons corresponds to a momentum uncertainty 
A p > 1/t for the photons. 

In the <2(H) representation w(r) = <r| W\r) is the position probability 
density for the ensemble W of electrons. We now wish to establish the 
relationship between this probability density and the probability density for 
the blackened silver grains in the photographic plate. This is usually done as 
follows: In Figure 42 the dashed line from X of the blackened grain which 
passes through the center of the lens corresponds to a “measured” position 
x of the electron. This inference from X to x is a consequence of geometrical 
optics; every bundle of light rays emanating from x which passes through the 
lens crosses the photographic plate at X. This geometric construction 
establishes a correspondence between a surface area F in the image plane with 
a surface area F 0 in the object plane. For this reason we say that the occurrence 
of a blackened silver grain in F corresponds to the determination of a 
position measurement in F 0 . 

If b 0 denotes the registration method described above consisting of a light 
source (which produces the photons), a microscope and a photographic plate, 
and if b F denotes the registration procedure in which a silver grain is blackened 
in F, then we obtain i j/(b 0 , b F ) « E(F 0 ) where E(F 0 ) is the decision effect 
corresponding to the situation that the position (two-dimensional) of an 
electron at time t x lies in F 0 . 

Of course every experimental physicist knows that i p(b 0 , b F ) = E(F 0 ) 
cannot be an exact expression because the position of an electron cannot be 
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measured without a finite error associated with the finite resolution of the 
microscope, and because the scattering does not occur precisely at time 
but during the time interval fj — r/2 and t x + t/ 2. It is interesting that we 
speak of errors here even though an apparatus (as an object of nature) has 
a specific function and is not faulty. The expression “error” therefore cannot 
refer to the apparatus as such, but to the relationship between the apparatus 
and the idealization which we have in mind. We shall discuss this situation 
later. 

We shall now discuss this gedanken experiment in terms of the registra¬ 
tion procedures which were presented in III, §4.2 and in terms of the formula¬ 
tion of quantum mechanics presented in this book, especially those in XYI! 

We have already spoken of the blackening of a silver grain in the surface 
area F as a registration procedure b F . This expression already contains the 
following “idealization” which does not completely correspond to the 
experimental situation: A surface area F is a mathematical object, that is, an 
idealized concept; therefore, in an exact sense, there is no such “registration” 
in which a silver grain located in “F” is blackened. In order to characterize 
the actual registration process we must describe the evaluation procedure 
more precisely with the aid of a physical grid on the photographic plate. The 
blackened silver grains within each cell of the grid are then counted. Since 
such an evaluation procedure will, however, be of no significant value if the 
magnification of the microscope is so large that the imprecision due to lack 
of resolution is larger than the dimensions of the silver grain, we often use the 
idealization that the surface F p of the entire photographic plate is decomposed 
into surface area elements F. We have already considered such idealizations 
in IV where we have discussed the transition from the “real” Boolean ring 
ffl(b 0 ) to the abstract Boolean ring £ and its completion. In this sense the b F 
are elements of the “idealized” Boolean ring of surface elements F of the 
photographic plate. After we have clarified this step of idealization we shall 
make use of it in the following. 

If b 0 is the registration method (in the sense of II, §4.2) for the above 
gedanken experiment, then the b F are registration procedures for which 
b F czb 0 . Let F p denote the surface of the entire photographic plate. Clearly 
b Fp — b 0 is incorrect since, in an individual trial a single silver grain need not 
necessarily be blackened. Therefore b 0 \b Fp ^ 0. According to the axioms of 
quantum mechanics in III, to a surface area F of the photographic plate there 
corresponds an effect t j/(b 0 , b F ) e L. In the sense of the framework presented 
in II, III there is, therefore, no “error” in the registration method b 0 . There 
are, therefore, several physical reasons (for example, finite resolution) why 
il/(b 0 , b F ) # £(F 0 ) where F 0 is the surface area in the object plane corres¬ 
ponding to (according to geometrical optics) F. The wish (!) of the experi¬ 
mental physicist that i l/(b 0 , b F ) = £(F) is not satisfied by the registration 
method described above. Here the word “error” is therefore used by experi¬ 
mental physicists to describe the discrepancy between the actual registration 
method and the “desired” registration method. 

We now have obtained a very important result for quantum mechanics. 
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A measurement error for microsystems does not represent the difference 
between a “measured value” and a “real or intrinsic value” of the measured 
quantity. According to IV, §8 for microsystems it is impossible to attribute 
meaning to the expression “real or intrinsic value” because microsystems have 
no objective (that is, intrinsic) properties (with the exception of those 
associated with the elements of the center). The “error” associated with the 
finite resolution of the microscope is therefore not a difference between the 
measurement value x corresponding to the location of a blackened silver 
grain X and the “real” position x but is only definable as the difference 
between the actually measured effect il/(b 0 , b F ) and the desired effect E(F 0 ) 
of an idealized position measurement procedure. This difference may be 
computed, in principle, in the above example (and for many other real 
measurements) with the help of quantum mechanics alone. Thus quantum 
mechanics itself can provide important contributions for the measurement 
of microsystems. 

The possibility of using quantum mechanics for the above example 
depends upon the following structure of the registration methods. The 
measurement process described above may evidently be divided into three 
consecutive processes: 

1 The scattering of a photon on an electron, in which the presence of 
the microscope and photographic plate plays no role. 

2 The deflection of the photon after scattering into the lens (microscope 
objective). 

3 After passing through the microscope, this photon strikes the photo¬ 
graphic plate, and activates a silver bromide grain, where the latter 
turns black when the plate is developed. 

The process 1 corresponds to the scattering problem which we have examined 
in XVI. We have certainly not considered the scattering between photons 
and electrons there because we do not yet have a mathematically “clean” 
formulation of quantum electrodynamics. However, if we replace the optical 
microscope by an electron microscope and replace the electron to be measured 
by a proton, then we may consider process 1 as the scattering of an electron 
by a proton, a process which may be described by quantum mechanics. 

The process 2 may also be described by quantum mechanics if we replace 
the lenses by a refracting medium for photons. Again, this is more meaningful 
for the case of an electron microscope where the lenses are obtained by the 
use of magnetic fields. Here the deflection of electrons by a field can be 
described in terms of the methods described in VIII, §6. 

The process 3 cannot be described only in terms of quantum mechanical 
processes because the system encounters the macrosystems—the silver 
bromide crystals. Here we may, however, make use of the “effect of the 
microsystems striking a surface F” described in XVI, §6.1. If the sensitivity 
of the photographic plate was 100 percent, then the expression given in 
XVI (6.1.13) would be used (for 1 t -*■ — oo, t 2 -* +co) to describe the 
blackening of a silver grain in the surface F. Otherwise, it is necessary to 
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obtain an efficiency factor X where 0 < X < 1 which is to multiply expression 
XVI (6.1.13), in order to obtain the “effect of blackening” from the elfect of 
“striking the surface”. 

Our gedanken experiment clearly illustrates why we wish to refer to process 
(1) as a measurement scattering process. 


2.2 Measurement Scattering Morphisms 


Measurement scattering refer to the processes investigated in XVI but 
considered from a special viewpoint, namely, the viewpoint of their applica¬ 
tion to registration procedures. We now return to Figure 41. The apparatus 

(1) represents a preparation procedure a x which may be used to produce 
systems of type 1. We may therefore simply write: (p(a = W t where W y is an 
operator in the Hilbert space for systems of type 1. Similarly apparatus 

(2) in Figure 41 represents a preparation procedure a 2 which may be used in 
order to produce systems of type 2. We therefore write cp(a 2 ) = W 2 where W 2 
is an operator in the Hilbert space for systems of type 2. The registration 
apparatus (3) in Figure 41 will register only “after the scattering”. 

In the case of scattering experiments we have introduced a new structure 
in XVI, §1—a new preparation procedure a = y(a t , a 2 ) corresponding to the 
composite system. We now consider (and this is our new viewpoint) the 
apparatuses (2) and (3) together as a registration apparatus for the systems 
produced by (1) (see Figure 41). We will now express this viewpoint mathe¬ 
matically in terms of a new structure which identifies certain pairs (a 2 , b 0 ) 
and ( a 2 ,b ) (where a 2 eQ', b 0 e 0l ' o , b e ,3? and b cz b 0 ) with a registration 
method B 0 and a registration procedure B as follows: 


(a 2 , b 0 ) -> B 0 , 
(a 2 , b) -> B, 


( 2 . 2 . 1 ) 


what, for fixed a 2 , represents an isomorphic mapping of 8#(b 0 ) onto $(B 0 ). 
Of particular interest are the corresponding effects \I/(B 0 , B) if q>(a 2 ) and 
i l/(b 0 , b) are known. This problem can be solved by purely quantum mechani¬ 
cal methods. 

Here we shall only consider the case in which the scattering of systems 1 
onto systems 2 can be described with the aid of a scattering operator. This 
is the case when both system types 1 and 2 are elementary system types. 
Under these assumptions we have to use the ensemble 

W = S(W X x W 2 )S + (2.2.2) 


for the calculation of the registration probabilities for the apparatus (3) in 
Figure 41 (see, for example, XIV, (4.4.5b)) where W x = (p(a t ), W 2 = <p(a 2 ). 

According to the meaning of process 1 in the example in §2.1 we will 
discuss the case that the system 2 is used for the “measurement of system 1”, 
that is, after the collision only system 2 is registered. We therefore assume 
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that the registration apparatus (3) which is described by the registration 
method b 0 e 0& o registers effects of the form 1 x f 2 , that is, 

*Kb 0 ,b) = 1 x F 2 . . (2.2.3) 

From (2.2.2) and (2.2.3) we find that the probability that b responds is given by 

Ha nb 0 ,anb) = tr(S(Wi x W 2 )S + (l x F 2 )), (2.2.4) 

where a is the preparation procedure y(a u a 2 ). On the other hand, on the 
basis of the correspondence (2.2.1) we find that 

X{a nb 0 ,a nb) = A(aj n B 0 , a t n B) 

= tti(<P(ai)ip(b 0 , B)) = tr 1 (W r 1 F 1 ), (2.2.5) 

where i//(B 0 , B) = F i . The correspondence (2.2.1) is a structure which corres¬ 
ponds to a different division of the experiment into preparation and registra¬ 
tion parts. Therefore, it is consistent with the formulation of quantum 
mechanics presented earlier in this book (up to XVI) only if (2.2.5) is satisfied. 
On the basis of quantum mechanics this is the case since the right-hand side 
of (2.2.4) is, for fixed W t and F 2 , a bounded linear form over W t e and 
therefore over ,#(J^). Thus there exists a F t e M'(M \) such that 

trfSW x PV 2 )S + (1 x F 2 )) = tr 1 (FF 1 F 1 ). (2.2.6) 

Since the left-hand side of (2.2.6) takes on values between 0 and 1 for 
W t eK(J#l), it follows that F\ e L(M{), from which we have proven 
consistency. 

From (2.2.6) it is evident that, for fixed W 2 , a 3d continuous effect morphism 
L(M 2 ) -> L(Ml) is defined by 

T(1, 2; W 2 )F 2 = F 1 . (2.2.7) 

Here it is easy to discover examples for which T(l, 2; W 2 ) does not, in 
general, map the set of decision effects G{.ff 2 ) into the set of decision effects 
). Indeed, to the author’s knowledge, there is no realistic scattering 
operator S for which, for suitably chosen W 2 , the relation 

T(l, 2; W 2 )G(JF 2 ) cz G(^) 

is satisfied. If we introduced additional axioms in Chapter III which lead to a 
strengthening of the map i// in the form G, then (2.2.1) would no longer 
be consistent with the previously introduced axioms. The consistency 
problem (that is, freedom from contradiction) for further development of 
the theory arising from the addition of new structure is nontrivial. The 
consistency shown above shows us, conversely, that the axioms introduced 
in III were realistic, because the exclusion of the situation described by (2.2.1) 
would be unrealistic. Therefore it is particularly important that 1 /'(ho, b) e G 
does not hold in general. 

In order to make evident the fact that (2.2.7) describes the relationship 
between the maps J' K and & L and the correspondence (2.2.1) we 
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rewrite (2.2.7) in the form 

il/(5 0 , b ) = T(l, 2; (p(a 2 ))il/(bo, b). (2.2.8) 

The effect morphism defined by (2.2.6) will be called the measurement 
scattering morphism. The use of a scattering of systems of type 1 on systems 
of type 2 for the purpose of measurement (that is, registration) of systems of 
type 1 will therefore be completely described by a measurement scattering 
morphism T(l, 2, W 2 ). 

We have introduced the maps cp and t j/ in (2.2.8) for the purpose of clarifica¬ 
tion. Later we shall often describe different partitions of experiments into 
preparation and registration parts only with words, without providing maps 
in the form (2.2.1). We shall only give the corresponding formulas for the 
ensembles and effects. We hope that in these cases the physical meaning of 
these formulas will be clear, namely, the relation between the different 
preparation and registration procedures. In the example from §2.1 the 
apparatuses described by b 0 and a 2 are the “lens and photographic plate” 
and “light source”, respectively. The apparatus described by b 0 is therefore 
the “light source + lens + photographic plate”. The arrangement of the 
apparatuses (lens and photographic plate) characterized by b 0 —in particular, 
corresponding to the analysis of the processes 2 in §2.1—have not been 
considered here; this will be done in §3.1. 

2.3 Properties of Measurement Scattering Morphisms 

In XVI we have seen that it does not make sense to speak of a precise “time 
point” for the scattering. For this reason the “discontinuous nature” of the 
transition from W t x W 2 to S(Wj x W 2 )S + is not “physically real”, but 
instead is only a symbolic discontinuity which enables us to ignore the detailed 
evolution of the systems during the scattering process. In the Schrodinger 
picture we may, in principle (as we have seen in XVI, §4.4) attribute the jump 
fromU^ x W 2 to S(Wj x to “any” time. Again, this description makes 

evident the fact that there is no precise “time point” associated with the 
measurement in applications of the measurement scattering process. Indeed 
this is the case even if the time interval of the interaction is relatively short, 
we may not, in advance, consider this particular time t as the time at which 
an observable is measured, since the observable which we measure for system 
1 with the help of the scattering of system 2 will strongly depend on what we 
measure for system 2 after the scattering. 

If X 2 L(J(%) is an observable of system type 2, then, from (2.2.7) and from 
the diagram 


F-i j) 



f i L(M j) 
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we obtain an observable £ 2 L(tf 1). According to (2.2.7) the measurement 

of an observable £ 2 4 L(J^) on system 2 corresponds to the measurement 
of an observable £ 2 T(^i) on system 1. In particular it follows that 

coexistent effects for systems of type 2 will be coexistent effects for systems of 
type 1. This situation can be illustrated by the realization (see IV, §4) 
PA(h 0 ) 4 L(J^ 2 ) °f an observable £ 2 4 L(Jt 2 ) and the corresponding map 
4 T(.J^) since on account of (2.2.1), 3t(h 0 ) and &(b 0 ) are isomorphic. 
The reader may mind that noncoexistent effects F t can be obtained from 
coexistent effects jF 2 if we alter W 2 , that is, the preparation procedure a 2 ! 

The choice of what we may measure for systems of type 2 after the scattering 
on system 1 remains completely open. The measurement scattering process 
does not uniquely determine the observables of system 1 which can be 
measured by the use of the scattering process. At first this situation is 
irritating—because we wish to, with the aid of the measurement process, 
determine what is actually registered. The measurement scattering process 
itself cannot clarify this final registration because we have to assume that 
the final registration is already given by a registration method b 0 for system 2. 

The set T(l, 2; W 2 )L(J^ 2 ) is decisive for the possibility to make conclusions 
about systems 1 from the registration of systems 2. This set is a a compact 
subset of L(3/F 2 ). We therefore call the set T(l, 2; W 2 )L(J^ 2 ) the “set of 
information possibilities” for system 1 on the basis of the measurement 
scattering on systems 2 in the ensemble W 2 . 

The extreme case, in which “no” information possibilities may be obtained 
from the scattering experiment does not mean that T(l, 2; W 2 )L(J^ 2 ) will 
be empty, because T(l, 2; W 2 ) contains all F x of the form F x = 21 where 
0 < 2 < 1, a result which is easily shown by setting F 2 = 21. We shall call 
the case in which T(l, 2; W 2 )L(M 2 ) = {2110 < 2 < 1} the case of “no” 
information possibilities since effects of the form F x = 21 do not permit us 
to distinguish between the W x . 

The latter case occurs when, for example, there is no interaction between 
systems 1 and 2, that is, when S = 1. However, S = 1 is not necessary for 
this case, as we shall see from the “examples” following below. 

Of particular interest is the case in which 

T(l, 2; W 2 )L(3f 2 ) = co(F£) (2.3.1) 

holds for a particular observable £ 4 L(J^). Then we say that the information 
possibilities in scattering are restricted to the “measurement of the observable 
£ -> L(M[)y\ For the case in which 

T(l, 2; W 2 )L(^ 2 ) = L(^) (2.3.2) 

we say that by the scattering “no information possibilities are lost”. 

A systematic examination of the relation between S and the sets 
T(l, 2; W 2 )L(M' 2 ) has not yet been done. Here we shall only consider certain 
“principal” examples, without knowing whether they can be achieved for real 
interactions (that is, for realistic Hamiltonian operators). In these examples 
we shall set W 2 = where \ji e :W 2 . Furthermore, for a suitably chosen 
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complete orthonormal basis {<p v } in ^ we assume that 

S<p v $ = Zv»7v> (2.3.3) 

where y v e y\ v e^C 2 . Since the <p v \jj are pairwise orthogonal and S is 
unitary, the y v ^ v must be pairwise orthogonal: 


<Zv»?v, = <Zv, ^><»7v, = <v (2.3.4) 


Case 1. The y v are pairwise orthogonal. 

For ^ = P„ and x W 2 )S + = S(P„ x PJS + = SP^S + = P s ^ 

and <p = S v <p v a v it follows that 


trCSfW; x W 2 )S + (1 x F 2 )) = £ |a v | 2 <t 7 v , F 2 t/ V > 

V 

= triWFJ = £a v a/<jo v , F^). 
Since this holds for all vectors (p, that is, for all possible a v satisfying 


£KI 2 = l, 

v 

it follows that 


that is, 


<<Pv, F l9fl } = S vfl {ri v ,F 2 ri v '), 


F 1 ='Z<r, v ,F 2 r, v }P <Pv . (2.3.5) 

V 


The complete (atomic) Boolean subring X, of G(J^) generated by the P 9v is a 
maximal decision observable. We will show that the relationship 


T(l, 2; P^)L(J>F 2 ) c co(Z,) (2.3.6) 

follows directly from (2.3.5). 

Since F 2 e L(^f 2 ) it follows that 0 < (rj v , F 2 rj v ) < 1. If A v is a sequence of 
real numbers for which 0 < A v < 1, then in the a topology it follows that 


N 00 

I £ A v p„ v 

v=1 v=1 

since tr(WEv=i hP<p v ) tr(W £" =1 A V P^) holds for all We K, a result 
which is easily proven with the aid of tr(IF) = £ v (<Pv, = 1. In order 

to prove (2.3.6) we need only show that, for any finite sum 

£ A v P„ v e co(Zj) 

V = 1 

holds. For this purpose we shall renumber the finite sum in order that 
A, < A 2 < A 3 < • • - < Aj, holds. Then we may write 

I A v P Vv = Aj £ P„ v + (A 2 - Ai) £ P Vv 

v=1 v = l v = 2 

+ (A 3 — A 2 ) E P<p* + (Ajv — A w _i)P 9jv 

v = 3 

+ (1 — Ajv) 0. 
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Since 


Ai + (A 2 — Aj) + (A 3 — A 2 ) + • • • + (1 — A w ) — 1, 

it follows that £f = i A v P (?v is a convex combination of elements of E^ 

If the rj v are also pairwise orthogonal, then, for a given choice of P 2 we find 
that Z, c T(l, 2; P^,)L(^f 2 ) and we obtain 

T(l, 2, P^)L(^ 2 ) = co(Ej) (2.3.7) 

and we obtain an example for the case (2.3.1). We may also easily given an 
example of an observable for system 2 , for which the measurement of system 

2 is equivalent to the measurement of the decision observable Ej for system 1 . 
To this purpose we define E 2 to be the complete Boolean subring of G(Jf 2 ) 
which is generated by the P„ v . Then T(l, 2; P, ;/ ) is the isomorphic map of 
E 2 onto Ei for which T(l, 2;P^,)P, v = P, Pv . A “measurement of P, v ” on 
the system 2 therefore corresponds to a measurement of P (pv on system 1. 

If all the are identical, that is, r] v = rj then we obtain F 1 — A1 where 
A = <ti, F 2 r ,> and T(l, 2; P^)L(^) = {A110 < A < 1}. 

Case 2. The rj v are pairwise orthogonal. 

As in Case 1, for cp = E v <p v a v we obtain 

£ a v a M <<p v , F 1 (p^y = £ a v a/Xv, ^2^> 

V/i V/i 

and we therefore obtain 

<<Pv, F i<P„> = <Av, A„X>?v, F 2 ri v >. (2.3.8) 

For the special case in which all y v are identical (that is y v = y) it follows that 

<<Pv, 

and we therefore obtain <<p, P^) = P 2 f/> where >7 = £ v » 7 v a v . From 

<<p, P!(■/)) = <f/, F 2 rj) we therefore obtain (2.3.2). The case Scp^ = yj? v is 
therefore an example for which no information possibilities are lost. In 
particular, we may measure the observable E, c G(^f t ) described earlier in 
Case 1 on system 1 by measuring the observable E 2 defined above on system 2. 
In the same way we may, however, measure every observable on system 1 by 
measuring a corresponding observable on system 2 . 

3 Measurement Transformations 

Process 2 in the example from §2.1, namely, the reflection of a photon by a 
lens is an example of a measurement transformation. In general, by a measure¬ 
ment transformation we mean the application of interactions of other 
systems upon the system to be registered in order to register the system by 
“previously known” methods after the interaction. Measurement transfor¬ 
mations are used in great variety by experimental physicists. This topic will 
be discussed in great detail in § 6 . 2 . 
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3.1 Measurement Transformation Morphisms 

We shall return to the considerations of §2.2 with the single difference that 
we shall measure the system 1 after the scattering, that is, we replace (2.2.3) by 

4r(b 0 ,b) = F{xl. (3.1.1) 

Here we have introduced the index / for F t in order to indicate that this F{ 
is to be registered after the scattering. System 2 will therefore only be used 
as a registration tool, because, after the scattering we shall only register 
systems of type 1. As in §2.2, from 

tr(S(Wi x W 2 )S + (F{ x 1)) = trfW^i) (3.1.2) 

it follows that there exists a 'M continuous effect morphism T(l, 1; W 2 ) of 
L(Jf?j) into itself such that 

(K5 0 , b) = F[ = T(l, 1; W 2 )F{. (3.1.3) 

T(l, 1; W 2 ) is called a measurement transformation morphism. Here it is 
evident that the measurement transformations are very useful tools which 
permit us to obtain “new” effect procedures ( B 0 , b) from “previously known” 
effect procedures ( b 0 , b). 

Measurement transformation morphisms may lead to interesting results 
even in cases in which no information is transferred from systems 1 to systems 
2 (see §2.3). Such a special case occurs if we consider an external field as 
system 2 (see VIII, §6). Then the entire measurement transformation can be 
considered in terms of the Hilbert space of system 1. The registration 
method b 0 is obtained from b 0 by “switching on” the external field before b 0 
registers. 

Let S denote the scattering operator for the external field. In this case we 
replace (3.1.2) by 

tr 1 (SIT 1 S + F{) = tr,( Wi-Fi). (3.1.4) 

Thus it follows that 

F\ = S + F{S. (3.1.5) 

Therefore the measurement transformation morphism is defined by 

T(l, 1)F{ = S + F{S. (3.1.6) 

T(l, 1) is therefore an automorphism of L(M \) which is determined by the 
unitary transformation S. 

3.2 Properties of Measurement Transformation Morphisms 

From the considerations of §2.3 it simply follows that the measurement 
transformation morphisms have the same properties as the measurement 
scattering morphisms. They transform coexistent effects into coexistent 
effects and observables into observables. 
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Here it is important to perform a systematic analysis of the relationship 
between S and the set T(l, 1; W 2 )L(^\) which describes the change of 
information possibilities which takes place as the result of the measurement 
transformation. Since such an analysis has not yet been done, we will consider 
the same examples of operators S as we have demonstrated in §2.3, that is, 
we assume (2.3.3). 

In Case 1 (in which the y v are pairwise orthogonal), for W 2 — and 
W x = P <f) , from 

tr(S(W x x W 2 )S\F{ x 1))= 

V, fl 

= tr(W^Pi) = X ®v a ^<<Pv; F\(Pn> 

\,n 

we obtain the relation 

<<Pv, -Fi<P„> = <Zv, F (x„></7v, »/„>• (3-2.1) 

If the r] v are pairwise orthogonal, it follows that 

F\ = E <Zv, F{ Xv }P Vv . (3.2.2) 

V 

Thus, as in §2.3 it follows that 

T(l,l;P*)L(^) = cb(I‘), (3.2.3) 

where Si is identical to S x in §2.3. If we define E{ as the decision observable 
defined by the P Xv , then we find that in this case T(l, 1; P^) transforms the 
observable E{ into the observable 

If the are all equal, that is, t] v = rj, then from (3.2.1) it follows that 
<(Pv, F 1 ^) = <y v , Fix^y and we therefore obtain 

T(1,1;P*)L(^) = L(^). (3.2.4) 

from which we cannot conclude that T(l, 1; /v) is an automorphism of 
L(M X ) because the y v , for example, need not form a complete orthonormal 
basis in 

In Case 2 (the are pairwise orthogonal) from 
tr(S(P„ x P*)S + (P{ x 1 )) = E KI 2 <Xv, f!Zv> 

V 

= tr(P,/\) = E M/'Pv, F\(p fl ) 


it follows that 

<(Pv,F\ = <5 v# ,<y v ,T{Zv> 


and we therefore obtain 



= E<Zv,f{Xv>-P«, v - 

V 

(3.2.5) 

Thus it follows that 

T(l, 1; P^)L(^i) c: cb(Ei) 

(3.2.6) 
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with the previously defined The case in which the y v are pairwise ortho¬ 
gonal we have treated earlier. 

If all Xv are identical, that is, x v — /, then it follows that F\ — A1 where 
A = <x, F{x) and we obtain 

T(l, 1; P„)L(^) = {A1|0 < A < 1}. (3.2.7) 

For our examples, together with the results of §2.3 we obtain: If a scattering 
process is extremely unsuitable as a measurement scattering process, then we 
will lose no information possibilities from its use as a measurement transfor¬ 
mation. Conversely, if a scattering process is extremely unsuitable as a 
measurement transformation, then no information possibilities are lost for 
its use as a measurement scattering process. 

The special case of a scattering by an external field is extremely ill-suited 
for a measurement scattering process. In this case T(l, 1) is an automorphism 
of L(J^), and is therefore extremely well-suited as a measurement transfor¬ 
mation. 


4 Transpreparations 

We now return to the measurement situation described by Figure 41 in 
which we replace the composite apparatus consisting of the two apparatuses 
(1) and (2) by an arbitrary preparation apparatus (1,2) for systems composed 
of systems of type (1) and (2) (see Figure 43). In this way the Hilbert space ,9f 
of the composite systems can be written (without additional symmetrization) 
in the form x ^f 2 where F\ x 1 and lxf 2 are the effects which 



Figure 43 
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are produced by subsystems 1 and 2, respectively. We may consider the 
registration apparatus (3) in Figure 41 to be composed of two parts (3, 1) and 
(3, 2) (see Figure 43) where (3, 1) will respond only to systenls of type 1 and 
(3, 2) will only respond to systems of type 2. 

Let a denote the preparation procedure corresponding to the apparatus 
(1, 2). We therefore obtain (using somewhat simplified notation—see §2.2) 
(p(a) = W where W is an operator in / = x / 2 . The composite 
apparatus consisting of (3, 1) and (3, 2) corresponds to a registration method 
b 0 and a registration procedure b where i l/(b 0 , b) e L(M’). 

It is now necessary to develop an axiomatic formulation of the statement 
that apparatus (3) is obtained by combining the two apparatuses (3, 1) and 
(3, 2). Suppose we are given an &(b 01 ) and an 3#(b 02 ) such that 

HKi, hi)eL(^) 

and ily(b 02 , b 2 ) e L(. 2 f 2 ) where 0l(b ol ) and 0t(b O2 ) correspond to the appara¬ 
tuses (3,1) and (3, 2), respectively. 

Let Zi 2 be the free Boolean ring generated by the Boolean rings &(b 01 ) and 
&(b 02 ). This new Boolean ring can be easily constructed from the two Boolean 
rings 8&(b 0l ) and @(b 02 ) by constructing the appropriate “logical switching 
network”. We then require that there exists an isomorphic map h of Z 12 onto 
3#(b 0 ). The two rings !M(b 01 ), 0t(b O2 ) may then be identified with subrings of 
Z 12 , that is, 3l(b 01 ) may be identified with the ring of all b y a b 02 where a is 
defined in the abstract Boolean ring Z 12 . In this sense hb x and hb 2 are defined 
for all b x 6 &(b 0 ,) and b 2 e @l{b 02 ), respectively. We then require that 

\l/(b 0 , hb x n hb 2 ) = ij/(b ou b x ) x i//(b 02 , b 2 ). (4.1) 

The requirement (4.1) is equivalent to the somewhat more physically intuitive 
requirement 

tr(W x W 2 )iP(b 0 , hb t n hb 2 )) = tr^WMbou b x ))U(W 2 xl<{bo 2 , b 2 )) (4.2) 

for all W x , W 2 : the proof of this statement is left to the reader (see VIII, §2). 
Similarly, we suggest that the reader prove that ^(fe 0 , b) is well defined by 
(4.1) for all b e 3$(b 0 ). 

In the same way from two observables Z x L(:W\) and Z 2 ^ L(M 2 ) 
we obtain a uniquely determined observable Z 12 -4 L(#f x x . 2 ^) by taking 
the completion Z 12 (in the sense of IV, §1.4) of the free Boolean ring Z 12 
generated by L, and Z 2 using the following condition 

F(a j A a 2 ) = F^ffj) x F 2 (a 2 ) for a x e Z t , a 2 e Z 2 


which is analogous to (4.1). 

In the next sections 4.1 to 4.4 we shall examine the structure obtained by 
considering the apparatus (1, 2) together with the apparatus (3, 2) as a 
preparation apparatus of systems of type (1) (see Figure 43). 
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4.1 Reduction of a Preparation Procedure by Means 
of a Registration Procedure 

We may represent the preparation apparatus obtained by combining the 
apparatuses (1, 2) and (3, 2) (see Figure 43) by a mapping it of a subset of M 
into M, that is, by mappiii^ systems of type (1,2) (that is, pairs 1, 2) into 
systems of type 1 where kx is “partner 1” in the pair x. This formulation 
suggests that there is a mapping principle concerned with further experiments 
with partner 1. 

Here k maps the set a n hb 2 (using the above map h) onto a set 
a — k(a n hb 2 ). This map which assigns a set a to each pair (a, b 2 ) with 
b 2 e 8%(b 02 ) will, for simplicity, also be denoted by k. We now require, as an 
axiom, that 5 e J: 

(a,i> 2 ) 4dei>. (4.1.1) 

For fixed a this map is a homomorphism of the Boolean ring 8t(b 02 ) onto a 
Boolean subring of 3. for which the probabilities satisfy: 

X(k(a, b 2 ) n b 0l , k(a, b 2 ) n b t ) = A(a nhb 2 ,a n hb 2 n hby) (4.1.2) 

(where the latter is an additional axiomatic requirement for k !), since both 
sides experimentally yield the same probability where k(a, b 2 ) is considered 
to be the preparation procedure on the left-hand side and a is considered to 
be the preparation procedure on the right-hand side of (4.1.2). 

According to quantum mechanics from (4.1.2) it follows that (p(k(a , b 2 )) is 
known whenever il/(b 02 , b 2 ) and (p(a ) are known. In order to obtain the 
expression for (p(k(a, b 2 )) we begin with 

X(a nb 0 , a n hb t n hb 2 ) = A(a nb 0 , a n hb 2 )X(a n hb 2 , a n hb t n hb 2 ) 

from which it follows that 

tr((p(a)i^(h 0 , hbi n hb 2 )) = tr(<p(a)^(h 0 , hb 2 ))Ha n hb 2 , a nhb l n hb 2 ). 

Using the abbreviated notation (p(a) = W, i//(b 01 , b t ) = F u il/(b 02 , b 2 ) = F 2 , 
from (4.1) we obtain 

tr(PF(F! x F 2 )) = tr(JF(l x F 2 j)X(a n hb 2 , an hb 1 n hb 2 ). 

From (4.1.2) we finally obtain 

tr(W(l x F 2 ))A(k(a,b 2 ) n b 0l , k(a, b 2 ) n b,) = tr (W(F t x F 2 )). (4.1.3) 

According to (4.1.3) (as we shall shortly find) <p(k(a, b 2 )) is well defined 
providing that tr(JF(l x F 2 )) ^ 0. This result demonstrates the consistency 
of the new structure with the previously developed quantum mechanics. 
Here tr(IF(l x F 2 )) — 0 means that there are no systems of type 1 the partner 
2 of which will produce the effect t//(b 02 , b 2 ). In order to calculate (p(k(a, b 2 )) 
we rewrite the right-hand side of (4.1.3) as follows: 

tr (W(F t x F 2 )) = tr(IF(l x F 1 2 /2 )(F 1 x 1)(1 x F\ 12 )) 

= tr((l x Fl l2 )W(l x F 1 2 I2 )(F 1 x 1)) 

= tr^R^l x F\! 2 )W{ 1 x F^W,), 


Pidie. 
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where R l is the reduction operator onto subsystem 1 (see XVI, §1). Thus it 
follows that 


where 


A(fc(a, b 2 ) n b 01 , k(a, b 2 ) n b t ) = trj(<p(fc(a, b 2 ))F 1 ), 


cp(k(a, b 2 )) = 


JRi[(l x F\ l2 )W(l x F 1 2 l2 )1 
tr(IT(l x F 2 )) 


(4.1.4) 


Instead of the map (p(a ) -> cp(k(a , b 2 )) we shall consider the map 

(p{d) -> tr(IT(l x F 2 ))(p(k(a, bj), 

that is, the map S(l, F 2 ) defined by 

S(l, F 2 )W = R& 1 x Fl' 2 )W(l x Ti /2 )]. (4.1.5) 

K— 1 ~ F2) > K i is, as it is easy to show, an operation (see V, D 4.1.2). S(1;F 2 ) 
is also defined by means of the following identity: 

tr (W(Fi x F 2 J) = tr t ([S(l; F 2 )W~\F t ). (4.1.6) 

An operation measure x is defined by b 2 -4 S(l;il/(b 02 ,b 2 )) on the Boolean 
ring $(b 02 ) as follows: 

^ 02 )^ n (4.1.7) 


(where II is given in V, D 4.3.1), a result which follows from (4.1.6). By com¬ 
pletion of the Boolean ring tM(b 02 ) to a Boolean ring X 2 we obtain from (4.1.7) 
a transpreparator X 2 ^ n in the sense of V, D 4.3.3. 

This transpreparator, for fixed a, and therefore for fixed W = cp(a), defines 
a preparator X 2 K(^) (in the sense of IV, D 5.4) of R , W as follows 
(a el): 

ri(°) = X(°)W. (4.1.8) 

For the observable X 2 5 L(jF 2 ) corresponding to 3$(b 02 ) we therefore find 
that 


r,(a) = S(l, F 2 (a))W. (4.1.9) 

X 2 may be considered to be an idealized completion of the Boolean ring 
which is the image of 0t(b O2 ) under the map k, where I 2 K{M[) describes 
the preparator which corresponds to the formation of a preparation apparatus 
by means of putting the apparatuses (1, 2) and (3, 2) together. 

In our considerations we have chosen a fixed preparation procedure a. 
The apparatus (1, 2) itself may, however, correspond to a preparatory It is 
not difficult to extend the above considerations to this case. Let X 12 K(£F) 

where ^ x #f 2 be the preparator which corresponds to the apparatus 
(1, 2). Let X 2 -* L(M 2 ) be the observable corresponding to the apparatus 
(3, 2). Let X denote the free (complete) Boolean ring generated by X 12 and X 2 . 
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For a 12 e E 12 , a 2 e E 2 and therefore for rr 12 a a 2 e E the expression 

niP 12 A ^ 2 ) = S(l, F 2 (a 2 ))rj 12 (a 12 ) (4.1.10) 

defines an additive measure E 4 Ki'^) which represents a preparator of the 
ensemble R 1 ni 2 ( £ i 2 ) (where e 12 is the unit element of E 12 ). 

The “combination” of the preparator E 12 K(,yf ) with the observable 
E 2 4 L(.# 2 ) in the manner described above leads, therefore, to a new prep¬ 
arator E 4 K(3/fy) which corresponds to the combination of the apparatuses 
(1, 2) and (3, 2) to form a preparation apparatus for systems of type 1. 

All the above is consistent with quantum mechanics. However, this 
“combination” can be disturbing, as we shall find in §4.4. First we shall 
carefully study an often used special case of the above. 


4.2 Transpreparation by Means of Scattering 

We shall consider the special case of the situation described in §4.1 and illus¬ 
trated in Figure 43 in which the apparatus (1, 2) is a combination of two 
apparatuses (1) and (2) in the sense of a scattering experiment, as we have 
described in Figure 41. In this case we do not vary the ensemble produced 
by apparatus (2). The apparatuses (3, 1), (3, 2) will register only after the 
scattering. 

In the considerations of §4.1 we need only replace Why the special choice 
S(JF, x W 2 )S + . By analogy with (4.1.5) from: 

S(l, 2; W 2 ,F 2 )W 1 = R ,[(1 x F 1 2 l2 )S(W l x W 2 )S + ( 1 x F 2 /2 )] (4.2.1) 

we define an operation (for fixed W 2 ) as follows: 

K(.#i) S(U2:W2,F2) > KW). (4.2.2) 

If we consider the (idealized) observable E 2 5 L(,/f 2 ) corresponding to the 
apparatus (3.2), then we define a transpreparator 

S 2 4 n, (4.2.3) 

where 

yfix) = S(l, 2; W 2 ,F 2 (a)). (4.2.4) 

It describes the transpreparation of an ensemble W l with the aid of the 
scattering on systems 2 from the ensemble W 2 and with the help of the 
observable E 2 % L(M 2 ). 

For fixed W l the transpreparator defines a preparator 

e 2 4 toa 

where 

ni a ) = S(l, 2; W 2 , F 2 {a))W x , (4.2.5) 
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where 

r,(e) = R^SiW, x W 2 )S + ], 

Here we abbreviate 77 (e) = W{. 

This preparator leads to coexistent decompositions of W{ (see IV, §5). If, 
for example, e = a t is a partition of unity e of Z 2 (that is, cr ; a <jj = 0 for 
i # j) then 

W{ = X >l(^m (4.2.6) 

i 

is a decomposition of the W{. All such decompositions of W{ obtained from 
the same observable S 2 ^ L(J%) are coexistent decompositions (W 2 fixed). 
If we change the observable S 2 Tp^l), however, then we may obtain 
noncoexistent decompositions of the same ensemble W{. 

We will illustrate this situation by means of two examples. First, we shall 
consider the “theoretical” example of the scattering operator S defined by 
(2.3.3). Here we shall only consider the case in which both the r] v and the / v 
are pairwise orthogonal We then obtain (2.3.7). Therefore, since all effects 
T(l, 2; P^)L(J^ 2 ) are coexistent, we may make the conjecture that all 
possible decompositions with the aid of observables I 2 L(J^ 2 ) are co¬ 
existent. This is, however, not the case. 

In order to show this we consider, as an observable, S 2 L(M 2 ) the 
decision observable generated by the P 7v where y v e is a complete ortho¬ 
normal basis. In order to compute S(l, 2; P tl ,, P y ) for a y e , we set W t = P (f> 
where (p = <p v a v . Then for rj(P 7 ), according to (4.2.5) it follows that 

tr (S(P„ x P^S+iF, x P y )) = trMPJFJ. 

Since S(P,, x P lj/ )S + — P# where (f) = y v 77 v a v it follows that 

tr MPyWJ = <4>, (Ft x p^y 

= E“va/Zv»?v,(fl X Py)X^n) 

vn 

Z ^v^/XZv> ^iZ.X^v, 

V/i 

= Z M/Zv, ^iZ/zX^v, yXz> ^>- 


For 

<S = z a/y, 


it follows that 




77(P,) = |5><^| 


and therefore 

S(l, 2; P^, PyjPy = 1 *5 a ><<5 a 1, 

(4.2.7) 

where 

<5a = E <<7V 4»><y;i. 

(4.2.8) 
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From 


= 'ZXvWv 

V 

we easily obtain 

W f 1 = R 1 P 4 = Y l K<Py,<P>\ 2 P Xv . (4.2.9) 

V 

The decomposition of W{ after the registration of the P. n is therefore 
given by 

W{ = El^X^I = I ll^ll 2 P W || iA ||. (4.2.10) 

x x 

The decompositions (4.2.10) are not coexistent for arbitrary choices of the y v . 
We shall show this for an example: 


(P = — f= Oi + (Pi), 7i = Mi + Pi'll, 7i = Mi ~ Mi 

and additional vectors which are orthogonal to y u y 2 . Since 11>• j|| = 1, 
\p 1 \ 2 + \P 2 \ 2 — 1- Thus it follows that 


and <5 V = 0 for v > 2. 
Thus we obtain 


< 5 i = -7= (Mi + P2X2), 

V2 


$2 ~ /-(P 2 X 1 P 1 X 2 ) 

V2 


W{ = \P Xi + \P X2 (4.2.11) 

with the decomposition corresponding to the effects P yv P, n given by 
W{ = |5i><5i| + |5 2 )<^ 2 | 

= p iii II || +jPd2i\\d 2 \\- (4.2.12) 

(5i||<5 t [|“ 1 and S 2 1| S 2 1| _1 are two orthonormal vectors in the plane spanned 
by Xu 7 . 2 - The mathematically possible different decompositions (4.2.12) 
with “any two” orthogonal vectors in the plane spanned by 7.2 can also be 
“experimentally” produced by choice of the observables {P y J. If ^ x is oblique 
to Xu X 2 , then the decompositions (4.2.11) and (4.2.12) are NOT coexistent, 
as we have already seen in IV, §6. 

For our second example we shall use the gedanken experiment described 
in §2.1, which we shall again encounter in a different context in the next 
section. 

The operation S(l, 2; W 2 , F 2 ) defined in (4.2.1) must be closely related 
with the measurement scattering morphism described in (2.2.7). From (4.1.6) 
it follows that, for the special case in which W = £(14) x W 2 )S + 

trtfW x W 2 )S + (F 1 x F 2 J) = trj([S(l, 2; W 2 F 2 )W 1 ]F 1 ). (4.2.13) 
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Using the mapping S'(l, 2; W 2 ,F 2 ) which is adjoint to S(l, 2; W 2 , F 2 ) it 
follows that 

tr(SW x W 2 )S + (F 1 x F 2 )) = triCWiCS'Cl, 2; W 2 , F 2 )F^). (4.2.14) 

For the special case in which F 1 = 1, if we compare (4.2.14) with (2.2.6) and 
(2.2.7) we obtain the relation: 

T(l, 2; W 2 )F 2 = S'(l, 2; W 2 , F 2 ) 1. (4.2.15) 

The observable corresponding to the transpreparator (4.2.4) (the associated 
observable in the sense of V, D 4.3.4) is therefore given by Z 2 L(M\) (where 
Ffa) = T(l, 2; W 2 )F 2 (<t)) and was introduced in the diagram in §2.3. The 
effect transformer which corresponds to the transpreparator (4.2.4) in the 
sense of V, D 4.3.4 is given by 

z4p, 

where 

X '(a) = S(l, 2 ;W 2 , Ffa)). 

Similarly, as described near the end of §4.1, these considerations can be 
extended to the case in which the apparatus (1) is not described by a single 
preparation procedure a x nor by a single ensemble W x but by a preparator 
Ej We shall now require that W 2 be fixed! Let Z be the free (com¬ 

plete) Boolean ring generated by E t and E^Foroq eE u a 2 eZ 2 and therefore 
a t a cr 2 e Z we define an additive measure Z \ R(M[) by means of 

nip 1 n a 2 ) = S(l, 2; W 2 , F 2 (<x 2 )>/ 1 (<t 1 ) (4.2.16) 

which is a preparator of the ensemble 

Ki[%i(u) x W 2 )S + l (4.2.17) 

where corresponds to the unit element of Z t . This preparator has the 
following meaning: It is the preparator for systems of type 1 which are 
obtained from the preparator Zj 2) K(M\) with the help of a scattering on 
systems of type 2 and of a further decomposition according to the responses 
of the a e Z 2 (where a is the idealized registration for systems 2). 

4.3 Collapse of Wave Packets? 

In this section we shall consider a concept which is widely found in the litera¬ 
ture of the measurement process in quantum mechanics and has led to many 
mystical ideas in quantum mechanics—the notion of the collapse (or 
reduction) of wave packets. Indeed, this notion has lead to apparent problems 
and to various attempts to solve these apparent problems. The mystification 
arose from a misunderstanding of J. von Neumann’s description of the 
measurement process presented in his famous book Mathematical Founda¬ 
tions of Quantum Mechanics [25], In that classic work J. von Neumann con¬ 
sidered the time dependence of a system 1 under the unitary transformations 
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U, as described in X. In addition he considered measurement processes 
in which a W x e is transformed by the measurement process into 

another W\. J. von Neumann obtained the transition from W t to W\ by 
making certain assumptions about the measurement process which we 
shall consider in more detail in §5. On the basis of this abstract axiomatic 
form it is possible to arrive at the misunderstanding mentioned above in 
which the transition W t -> W\ is not only considered to be a mathematical 
discontinuity but is also considered to be a physical one. The mystifying 
notions arise from attributing physical reality to the “jump” at a given time t. 

J. von Neumann has shown that the two time dependent processes are 
essentially different. The fact that they are different is clear on the basis of 
the representation and the interpretation of the two processes that we have 
given in VII, VIII, X and in this chapter. 

The processes described by U, describe the “free” system in the sense 
described in VII, namely, in the sense that the registration apparatus is 
displaced by a time interval t. The “free” propagation of systems 1 between the 
preparation and registration apparatus is extended by a time interval of 
length t. 

By a change from W l to W\ which occurs as the result of the measurement 
process we mean the change of the ensemble W t before the application of the 
registration procedure to W\ after the application of the registration pro¬ 
cedure which results from interaction with the registration apparatus. But 
what change did J. von Neumann intend in his axiomatic formulation? 

Disregarding the special form of the J. von Neumann postulate (which we 
shall consider in detail in §5), the transition from W t to W\ is nothing other 
than the process described in §4.2 with the aid of the transpreparators (4.2.3) 
and (4.2.4) as follows: 

W 1 - W\ = = AS(1, 2; W 2 , F 2 (k))W u (4.3.1) 

where A is a normalization factor, so that tr(IVj) = 1. Thus W j is obtained 
by decomposition according to the response of a. 

We shall now describe how it was possible to imagine (within the context 
of the decompositions (4.3.1)) a collapse of the wave packet by using the 
gedanken experiment described in §2.1. In this experiment we find a measure¬ 
ment scattering process between the light quanta and the electrons to be 
measured. Then the light quanta are registered with the aid of a microscope 
and photographic plate. Suppose that the ensemble of electrons prepared 
before the scattering W t is one of the P rp where q> has extremely precise 
momentum (in the object plane), then in the position representation 
<Xj, x 2 1 <p> is a very broad function in the object plane. 

Without examining the implications of “additional concepts” in quantum 
mechanics, many people consider the “wave packet” (x 1; x 2 \ (p) to represent 
the state of a single electron. Providing that the scattered light quantum is 
registered upon the photographic plate, we may (approximately) infer the 
location of the electron from the position of the blackened silver grain on 
the photographic plate. Suppose, for example, the position of the measured 





328 XVII The Measurement Process and the Preparation Process 

electron is approximately xf\ x ( 2 \ The following conjecture may appear to 
be reasonable: After the measurement not all positions associated with the 
wave packet (x 1 ,x 2 \cp} are possible but only positions in the vicinity of 
xf\ xf\ that is, “the wave packet <x t , x 2 1 <p) has, as a result of the measure¬ 
ment process, undergone a ‘collapse’ to a wave packet <x t , x 2 |^> which is 
strongly concentrated in the vicinity of xf\ x ( 2 0> .” 

The somewhat mystifying mode of description outlined above is a result 
of the addition of unnecessary preconceived notions to quantum mechanics. 
This will become more evident when we consider the following modification 
of the gedanken experiment described in Figure 44. The modification consists 
of locating the photographic plate in the focal plane instead of in the image 
plane. From the location of the blackened points on the photographic plate 
it is possible to infer the propagation direction of the scattered light quantum. 
If we choose the ensemble W 2 of the light quanta before scattering to be one 
for which the momentum is precisely fixed, and assume that the absolute 
value of the momentum of the light quanta is not changed by the scattering, 
then we may infer the momentum change of the electron from the direction 
of the scattered light quantum (since the momentum of the electron before 
the scattering is fixed). In this modified experiment the wave packet 
<x l5 x 2 | <p) does not collapse in space but undergoes a sudden transition 
into another wave packet (x 1 ,x 2 \i//) which has fixed momentum and 
therefore has a large spread in position x u x 2 . 

The form of the new wave packet which results from the “collapse” of the 
wave packet (x t ,x 2 \ <p> depends strongly upon where the photographic 
plate is placed. In principle it is possible to imagine a microscope which is so 
large that we have several hours (after the scattering) to decide where to place 
the photographic plate. Here the blackening of the photographic plate which 
happens several hours after the scattering will decide the manner in which 
the “jump” of the wave packet has occurred (several hours before the 
blackening!). This “decision” is, in part, also a decision of our free will! 

It is not surprising that the range of explanations for this imagined collapse 
of the wave packet is very broad, from the modification of the Schrodinger 
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equation to the interpretation of the wave packet <x t , x 2 \ <p> as a mathe¬ 
matical description of our “knowledge about the electron under considera¬ 
tion”. We do not need to examine all these attempts, because, according to 
the interpretation presented in II, the entire problem of the “collapse of the 
wave packet” is not a real problem but is a problem which arises from the 
addition of unnecessary ideas. 

According to our formulation of quantum mechanics nothing is dis¬ 
continuous. In particular, the above gedanken experiment may be described 
as follows: Electrons 1 of the ensemble W l = P v are scattered by light 
quanta 2 from ensemble W 2 , whereupon after the scattering the ensemble is 
described by S(P v x W 2 )S + where S is the scattering operator. The “jump 
from P v x W 2 before the scattering to S(P V x W 2 )S + after the scattering is 
only a computational rule which permits us to avoid the need to provide a 
detailed description of the behavior of the system during the scattering. The 
investigations presented in XVI associated with the introduction of the 
scattering operator S have made this clear. A true sudden “jump” does not 
occur. 

The scattered light quanta 2 can be registered by means of a microscope 
and photographic plate, the results of which are described by the effects F 2 . 
If X 2 is, as described in §2.1, the Boolean ring of surface elements in the image 
plane, then we may decompose the ensemble W{ = Ki[S(F v x IV 2 )S + ] for 
the electrons with the aid of the preparator X 2 -A K(M{) where 

r,(a) = S(l, 2; W 2 , F 2 (a)P, p 

according to which surface elements a of the photographic plate have been 
blackened. By choosing a “small” a then rj(a) is an ensemble with negligible 
dispersion of the position. 

We may also consider the Boolean ring H 2 of the surface elements of the 
focal plane; then, from the observable X 2 -+ L(M 2 ) we obtain a preparator 
Z' 2 4 where 

V'(o) = S(l, 2; W 2 , F’ 2 (o))P, p 

permits another decomposition of the same ensemble W{. If a e £ 2 is a small 
<t then ri'(rr) is an ensemble which has small dispersion in momentum (pro¬ 
viding that P rp and W 2 have small dispersion in momentum). 

The two preparators E 2 K(M[) and X 2 are, of course, not 

coexistent! Therefore the corresponding decompositions of W{ are not 
coexistent. The transpreparators 

£ S(l, 2 ;W 2 ,F 2 (---)) S(l,2; W 2 , Fii"')) j-j 

are therefore not coexistent. Noncoexistent transpreparators are, however, 
typical for quantum mechanics, as we have already seen in IV, §§5 and 6. 

The process of decomposition by means of preparators, as described in the 
form of Boolean rings Q(a) which has already been introduced in II will 
therefore be mystified by such modes of description as the collapse of wave 
packets. The possibility of two noncoexistent decompositions of the same 
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ensemble W{ with the aid of a photographic plate in either the image plane 
or focal plane is therefore only an example of the model described in §4.2 with 
the decomposition possibilities given by (4.2.8) to (4.2.10). 

Nevertheless, we should not conceal the fact that the fundamental quantum 
fact: the existence of noncoexistent (indeed, complementary, see IV, D 6.4) 
preparators raises a peculiar problem, which, psychologically, is difficult to 
avoid. This problem is known as the “Einstein-Podolski-Rosen Paradox”, 
although it is not a real paradox, that is, it is not a contradiction in quantum 
mechanics. We shall consider this “paradox” in the following section. 


4.4 The Einstein-Podolski-Rosen Paradox 

The starting point for our discussion of the so-called Einstein-Podolski- 
Rosen (EPR) paradox is the discussion in §4.1 (and the somewhat more 
specialized discussion in §4.2). 

For an observable Z 2 ^ L(J^ 2 ) we obtain a transpreparator 

X 2 4n, (4.4.1) 

where 

X(ct) = S(l, F 2 (a)). (4.4.2) 

S(l, F 2 ) is defined by (4.1.5) or (4.1.6). For an arbitrary W e x -W; 2 ) 
from the above transpreparator we obtain a preparator 

Z 2 4 K(.^), (4.4.3) 

where 

r/((r) = i(o)W (4.4.4) 

for the ensemble W x = R X W. The preparator (4.4.3) permits coexistent 

decompositions of W u that is, according to a decomposition a t of the unit 
£ of E 2 as follows: 

W x = rj(e)W = £ ri(Oi)W. (4.4.5) 

i 

Everything depends on the observable H 2 L(Jf 2 ). There are now many 
experiments by which we may measure the observable Z 2 L(jF 2 ) for 
systems of type 2 providing that there is no longer any interaction between 
systems of type 1 and type 2 for the ensemble W. Examples of such measure¬ 
ments are the measurements after scattering of systems 1 onto systems 2 
which we have discussed in §4.2. 

Since there is no longer any “interaction” between systems 1 and systems 2, 
the measurement may no longer change the ensemble of systems 1, that is, it 
should not make a difference whether systems 2 are measured (and what 
observables are measured for systems 2), that is, W x = R x IFis not changed by 
making an additional measurement (such as scattering systems 2 on systems 
3—see §6.1). This description is completely consistent with quantum 
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mechanics. However, with the aid of different observables Z 2 ^4 L(J^) we 
may obtain different preparators (4.4.3) and therefore obtain different 
decompositions of the same ensemble W t . The discussions in §4.2 (4.2.8)- 
(4.2.10) provide an example. 

In order to illustrate the so-called EPR paradox we shall construct 
examples of two observables L(J^) and E ( 2 2) L(M 2 ) and we shall 

choose an ensemble W such that the corresponding preparators Z ( 2 ) ’’-V 
and Z ( 2 2) &(■#[) of the same ensemble W t = P, IT are complementary in 

the sense of IV, D 6.4. 

We shall now describe a simple example. Let the systems 1 and 2 be elemen¬ 
tary systems of spin Let the apparatus denoted by (1, 2) in Figure 43 prepare 
an ensemble W = (p(a) e K(M{ x j/P 2 ) for which the “total spin” is zero, that 
is, W satisfies the relation tr(ITP) = 1 where P is the following projection 
operator in x t 


P = 1 x Pf where </> 

(where = j% b x « 1 ,Jf 2 = ^zb 
therefore has the form 


1 

72 

x 1 


[« + (l)M2)-M2)Ml)]6i 

2! = X ^2 bi i = *1 x 


i 2 )- 


w 


W = W b x Pj,, (4.4.6) 

where W b is an operator in . 

We may choose the apparatus (1, 2) such that (see Figure 45) the systems 
1 and 2 leave in completely different directions and therefore after the 
preparation are no longer in interaction. 

Let the apparatus (3, 2) measure the observable Z 2 L(M 2 ). This observ¬ 
able will be the decision observable “the measurement of spin in a given 
direction n”, that is, we can choose the Boolean ring I-,' 0 which is generated 
by the operators P„ + and P„ in t 2 where u + , u_ e t 2 are eigenvectors of the 
spin component of systems 2 in direction n. Z ( 2 > will therefore consist of the 
elements 1, P„ + , P„ , Q (as operators in i 2 ). 

Then, from (4.4.2), (4.4.3) and (4.1.5), after a simple computation, we obtain: 

ri(P u+ ) = lW lb x P„_ (4.4.7) 
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as an operator in 3/e Lb x ty, where W lb denotes the restriction R , W h of W b in 
3*P b = 3tfy b x J4? 2b . In the same way we obtain 

¥Pu.) = jW lb x P u+ . (4.4.8) 

Therefore, with the aid of the observable E ( 2 n) L(J%) we obtain the de¬ 

composition of the ensemble 

Wy = R 1 W=jW lb x 1 (4.4.9) 

in the form 

Wy = ±W lb x P u+ + ±W lb x P„_. (4.4.10) 

Here we recommend that the reader verify how the decomposition (4.4.9) 
of the ensemble Wy (prepared by the apparatus which consists of the appara¬ 
tuses (1, 2) and (3, 2)—see Figure 45) is achieved in terms of the responses of 
the apparatus (3, 2). The apparatus (3, 1) serves only to test whether our 
theory is correct. If, for example, we use (3, 1) to measure spin in the n 
direction on the component jW lb x P u we obtain the value (— 2 ) with 
certainty since 1 x P„ has no dispersion in jW lb x P u _. The response 
(+£) of the apparatus (3, 2) is therefore strongly correlated with the response 
(—|) of the apparatus (3, 2). The fact that it is possible to produce an ensemble 
jW lb x P„_ in the manner described in Figure 45 with the aid of the com¬ 
posite apparatus (1, 2) and (3, 2) is certainly not alarming. On the other 
hand, it is alarming to find that we may, without changing the apparatus 
(1, 2), by only rotating the apparatus (3, 2), obtain another preparator which 
is complementary to the original! 

Choose, for example, first n in the direction of axis 1 of the laboratory 
system, and second, in the direction of axis 2; we then obtain two observables 
S ( 2 1) ^> LC#S) and T 2 2) ‘V L(,^) which lead to two preparators of the form 
given in (4.4.7) and (4.4.8): 

ri a) (P u+ ) = iW lb x P u _, V 1 \P U ) = jWn x P u+ (4.4.11) 

and 

ri (2 \P v J = iW lb x P v _, r, i2) (P v .) = tW lb x P v+ , (4.4.12) 

where u+ , are the eigenvectors of spin in the 1-direction and v +, V- are 
the eigenvectors of the spin in the 2-direction. 

We now show that from 0 < W < W lb x P u we obtain the following 
relation W = W' x P u „. For 0 < W < W lb x P u _ and 0 < W < W lb x P„_ 
then it follows that W = W' x P„_ = W" x P v _ and therefore W = 0 
since tr((W' x P„ )(l x P„J) = 0 and tr((PF" x P„J(1 x P U J) = tx{W") 
tr(P„_P„ + )) = tr(IF")|<^_ |u + >| 2 and |<u_ |m+)| 2 ^ 0. According to IV, 
Th. 6.1 the two decompositions 

Wy = jW lb x 1 = v (1) (P u J + ri w (P u J 
and 

Wy = n {2 \p v+ ) + n a) (P v J 
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are complementary, and, according to IV, D 6.5 the two preparators 
Z'/ 1 4’ K(.yP x ) and Z' 2 2) '4 £p^) are complementary 

Since this is a very illustrative example for two complementary preparators, 
we will now describe this experiment in detail more precisely with the aid of 
preparation and registration procedures. Let a denote the preparation 
procedure associated with the apparatus (1, 2). We therefore have<p(a) = W 
where W is given by (4.4.6). Let bo\ h ( 0 2) denote the registration methods 
defined by the apparatus (3, 2) for the registration of spin in the 1 and 2 
directions, respectively. &(b ( Q >) consists of the sets h^ 11 , b ( l\ b ( l \ 0 where 
ik(b ( o\ b ( + ] ) = 1 x (1 x P u+ ) and i A(h < 0 1) , b n J) = 1 x (1 x P u ) with respect 
to the representation x x t 2 ) of the Hilbert space :W. A similar 

situation holds for ^(b^). 

According to (4.1.1) there exists a k given by 

(a, l #>) 4 a (1) , 

(a, b 4) 4 a ( +>, 

(a, ’) 4 a”', 

where ffl(b { Q } ) is isomorphically mapped onto J(a (1) ). It follows that 
<p(a (ly ) = W t = jW bl x leK(Jta 
<P(a+ } ) = W lb x P u _, 

<P(Z (U ) = W lb x P u+ . 

From a n> = a*)* u a ( i’ and a (1) n a ( i ( = 0we obtain the following decom¬ 
position (see IV, §5) 

<p(a a) ) = i<K3+ 1) ) + j(P(a ( - ] ). 

If we replace (1) by (2) we obtain the analogous formulas by replacing 
«+, by v + ,v_. 

Thus we have obtained two different preparation procedures a a) and a <2) 
where <p(a (l) ) = (p(a (2) ) = • a (1) is obtained by combining a with and 

a (2> by combining a with b ( 0 2 \ 

J(a (1) ) 4> 4 I k(.^fx) and l(a {2) ) ’’4’ ^(jfi) where <p a o> and (p 3 m are the maps 
introduced in §5 therefore represent two complementary preparators. Since 
they are complementary, according to IV, D 6.4 we must obtain 

s w n #2) = (4.4.13) 

Therefore (4.4.13) is a consequence of the quantum mechanical axioms. Thus 
we find that this is a nice example of the structures described more generally 
in IV, D 6.4. 

The same can be done for the example described in §4.3 of the decom¬ 
position of an ensemble W{ of electrons after the scattering with the aid of 
the registration points on the photographic plate. With the aid of a photo¬ 
graphic plate in the image plane we may decompose W{ into ensembles which 
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have little dispersion in position. The same ensemble may also be decom¬ 
posed, with the aid of a photographic plate in the focal plane into ensembles 
which have little dispersion in momentum. Since, according to the Heisen¬ 
berg uncertainty relation (see IV (8.3.16)) there are no ensembles which have 
little dispersion in both position and momentum, we therefore obtain two 
complementary decompositions of W{ without influencing the scattered 
electrons, namely, by placing the plates in the image and focal planes. For 
both these preparations of electrons d (1) (the photographic plate in the image 
plane) and a (2) (the photographic plate in the focal plane) we obtain 
<jo(d <1) ) = (p{a m ) = W{; Despite this, (4.4.13) must be satisfied! 

In this way we obtain a contradiction-free explanation of the EPR 
“problem” in quantum mechanics. We obtain EPR paradox from the EPR 
problem when we make (apparently) “plausible” conclusions from the 
observed phenomena. The apparent “plausibility” results when we believe 
that objective content is being transmitted through space by the micro¬ 
systems (even though the latter are not “tiny spheres”). 

The fact that it is possible to decompose ensembles in a complementary 
way without making use of the interactions with the microsystems of the 
ensemble can only rest upon the fact (or at least a widely held opinion) that 
the microsystems are already distinguishable with respect to a “property” 
that they will either respond or not respond when they are subjected to 
an effect procedure ( b 0 , b) if *J/(b 0 , b) is a decision effect, that is, if ij/(b 0 ,b)eG. 
Otherwise, how would it be possible, with the aid of a particular decom¬ 
position procedure, to select, without interaction, those microsystems which 
respond with certainty to ( b 0 , b)l 

These considerations make it plausible to claim the existence of a set S h 
satisfying the axioms AH 1 and AH 2 which have been introduced in IV, §8.3. 
Since we have to introduce AH 1 and AH 2 as axioms, it is clear that our 
“plausible” arguments do not represent deductions made from the EPR 
phenomena. The mathematical form of a theory cannot be outwitted by 
pretending that AH 1 and AH 2 are theorems. If we add AH 1 and AH 2 as 
axioms then we obtain a theory which is contradictory and is in this sense a 
paradox. The consequence has to be that the so-called “plausibility” 
arguments (leading to AH 1 and AH 2) are false! 

5 Measurements of the First Kind 

In the historical development of quantum mechanics a great role has been 
played by “idealized measurements” which are defined by formal axioms. 
Such axioms can be found in the first edition of this book (see [26], II, §3). In 
this edition such axioms are not needed for the fundamentals presented in II 
and III. However, since they have played such an important role, we shall 
now consider these “idealized measurements” in order to consider the dubious 
existence of such measurements. We cannot derive the existence of such 
“idealized measurements” from the axioms we have used for the development 
of quantum mechanics. 
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We now return to the discussion in §§2 and 4. From T( 1, 2; W 2 ) we obtain 
a new observable £ 2 L(M'\) from the observable S 2 E(J>F 2 ) as follows 

Ei(a) = T(l, 2; W 2 )F 2 (cr). (5.1) 

In addition, we obtain the transpreparator £ 2 4> n as follows: 

X (a) = S(l, 2; W 2 , F 2 («t)). (5.2) 

We define a measurement of the first kind as a measurement scattering 
together with an observable I 2 ^ L(.# 2 ) such that the relation 

t r i(Dc( ff )Wi](l - F,(a)) = 0 (5.3a) 

is satisfied for all a e Z 2 and all e K(M[). Equation (5.3a) is equivalent to 

tr(S(fF x x W 2 )S + ((1 - F^a)) x F 2 (<t)) = 0. (5.3b) 

The requirement (5.3a) for a measurement of the first kind has the following 
interpretation: After the measurement all those systems of type 1 which are 
sorted according to the “response of <r” will correspond to an ensemble in 
which F i(cr) will respond with certainty providing the observable Z 2 L(JF t ) 
is measured again. 

Measurements which are not measurements of the first kind, that is, for 
which (5.3) is not satisfied for all ueEj are often called measurements of the 
second kind. If E 2 -> G(J^), we then speak of measurements of the first kind 
for a decision observable. Since in the first decades of the development of 
quantum mechanics physicists have only considered decision observables, 
in the literature the use of the expression “measurements of the first kind” will, 
in the terminology used here, always refer to measurements of the first 
kind for a decision observable. 

The condition (5.3a) is identical to the condition IV (5.7) for the preparator 
W x (a) = x(a)Wt since F,(s + a) = 1 — F x {a). Condition (5.3a) therefore 
has, as a consequence, the fact that the preparator x( a )W u which is associated 
with the measurement of X 2 5 L(M 2 ), for each W 2 is a dispersion-free 
preparator with respect to the measured observable £ 2 -i L(^). According 
to IV, §5 it follows that (5.3a) can be satisfied only if, according to IV (5.12) 
where Ej/F is the support of W{ = x( £ )^i the measure F { (a) has the form 

F,{a) = E { {o) + (1 - E)F i (n)(l - E), (5.4) 

where E i(u) e G(:FF l ) and E x (a) < E, and the Boolean subring (generated by 
the Ei(o-) with E as the unit element) is atomic. Z 2 G(M[) is a u-additive 
measure on Z 2 with £j(e) = E, since F x (e) = 1. 

In order to evaluate this condition, we shall first modify the Boolean ring 
Z 2 . It is possible that F,(<r) = 0 for cr # 0. Since F t is a o-additive measure 
and Z 2 is complete, there exists a uniquely determined a s which is the support 
of the measure F x (a) (see IV, Th. 2.1.5, where we need only replace m(a) by 
tr 1 (WiF 1 («r)) where W x is effective). We therefore obtain Fj(ff s ) = 1 and 
Fife) ^ 0 for 0 ^ cr < (T s . Instead of S 2 we use the Boolean ring X 2 of all 
a < a s where <r s is the unit element of S 2 . The observable Z 2 -4 ) may 
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therefore be replaced by the effective observable I 2 L(M\). Z 2 ^ L(J^) is a 
generalized observable since F 2 ((t s ) ¥= 1. Since E^u) < Ej(ff) we therefore 
obtain £(cr) = E(a a cr s ) for all a e E 2 . Therefore we may also consider 
£ 1 (cr) to be a measure on E 2 :1 2 G{i #j) where E x (a s ) = £. 

We will now show that if Wj is effective then the projection £ will be such 
that the map Z 2 '-fi G(M \) is injective, that is, Z 2 can be identified with the 
Boolean ring generated by the E x (o) (where E is the unit element). Thus X 2 
must be atomic. 

In order to show that this is the case, choose an effective W x . The map 
f -2 G( Jtf'j) is injective if and only if it follows that if cr e Z 2 and E x (a) = 0 it 
follows that a = 0. Therefore let E x (a) = 0. Then since (5.4) it follows that 

triMffM) = trx(I V x F x (.o)) 

= t ri (Wj(l - E)F 1 (a)( 1 - £)) # 0 

providing that (1 — £)Ei(cr)(l — £) i=- 0, since W x was assumed to be 
effective. 

On the other hand, since 2 is the support of '/_(a)W u and E x (a) = 0 we 
obtain 


F x (a)) = tr x {[y.(o)W{]E x {a)) = 0 

from which it follows that 

tr 1 ([x(<r)W 1 ](l - EM)) = trMa)W,). 

Suppose that tr x (y(a)W x ) =£ 0, then (5.3a) would fail, and we must have 
(1 — E)F x (o)( 1 — £) = 0, that is, from £,(cr) = 0 it follows that F x (o) = 0. 
Therefore a a u s = 0 and since a e Z 2 it follows that a must be the null 
element of Z 2 . 

The observable Z 2 -> is therefore atomic, and we may replace Z 2 by 
the Boolean ring generated by £,(cr) with£ as the unit element. Therefore, 
instead of Z 2 we need only the atoms of Z 2 , that is, pairwise orthogonal 
sets of £ u which satisfy the following relationship E u = E. 

For (5.4) we now write (where F u = F i(cr £ ): 

£n = E u + (1 - £)£„•(! - £)• (5.5) 

We will now show that for all W x 6 the support of x(o)W x (for all a) 
must be smaller than £.#j. 

We begin by choosing an effective W v Since W x is effective, W x = 2 V w x P <fv 
where w v # 0 and the <p v form a complete orthonormal basis. For E we 
therefore obtain: 


tr(S(Wj x W,)S + ((1 - £) x 1)) = 0. (5.6) 

Thus, for all cp x and for W 2 = and # 0 we obtain: 

0 = tr(S(P„ v x PJS + (( 1 - £) x 1)) 

= tr( P SvvX Sa ~ £) x 1)) = ||((1 - £) x l)S<p v *J 2 . 
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Therefore we obtain 

[(1 -£) X 1 2S(p vXll = 0 

for all (p v and that is, 

S(p v x^EJ^ x jh %. (5.7) 

Conversely, if (5.7) is satisfied for all (p v Xn> then (5.6) holds. From (5.7) it 
follows that 

Scpx^eEJ^ x #C 2 (5.8) 

for all and all Xu - Thus it follows that 

tr(S(PF, x Wj)S + ((l -l)xl)) = 0 (5.9) 

for all W, e K(J%), that is, 

MMeWKl - £)) = 0, (5.10) 

which means that the support of ^(e)FFj is smaller than E:Xf x . Therefore, since 
X(a)W, < xif'-Wi it follows that 

X(a)W x = £\x(a)W^£ (5.11) 

holds for all a. 

The condition (5.3a), with the aid of (5.5) takes on the form 

- E u )) = 0 (5.12) 

for all i—this is equivalent to the condition 

tr(S(FFi x !T 2 )S + ((1 - E u ) x F 2i )) = 0, (5.13) 

where we have used the simpler notation F 2i = F 2 (a ; ) for the atoms a l of S 2 . 

Since (5.13) holds for all W 1 e K(Ml), it follows, as we have seen above, 
that S(px v must, for all (p e and all be eigenvectors having eigenvalue 0 
for all operators (1 — E u ) x E 2i . What is the subspace of all vectors 
(j) e x ^ for which the condition 

[(1 - £„) x F 2i ]<£ = 0 (5.14) 

is satisfied for all z'? 

c t ) can consist only of components T for which (for each z) either 
[(1 - E u ) x l] 1 ? = 0 or (1 x F 2i ) v F = 0. We note, however that [(1 — 
Eu) x 1]T = 0 can hold for at most one z because the E u are pairwise 
orthogonal. Let [(1 — E Uo ) x lj'P = 0, that is, (E Uo x 1)T = T. Thus 
(1 x F 2i )'F = 0 for all i z 0 , that is, only (1 x F 2h ) x F can be nonzero. If 
J% io is the subspace of Jf 2 for which all F 2i for z # z' 0 act as null operators then 
the space of the 4> for which (5.14) is satisfied must be equal to 

^©£i^xjf 2 , (5.15) 

i 

Since 1 = F x (a s ) = F u it follows that 

tr (S(W t x W 2 )S + (1 x X, F 2i )) =1. (5.16) 
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This can be the case only if, for all (p e and all y v the vector S<py v is an 
eigenvector of (1 x £7 F 2i ) with eigenvalue 1. For S(py v = (f> and </)* as the 
components of </> in £ i; J£j x ^f 2; from (5.15) it follows that 

= £ 4> t = (1 x F 2i )(j) = ^(i x F 2i )^ 

i i 

from which we obtain 

<t>i = (1 x f 2,)</>;• 

Let ( 2 * 1 denote the eigenspace of F 2i with eigenvalue 1; then from 
^F 2i =Y J F 2 (,cT i ) = F 2 (cT s )=l 

i i 

it follows that F 2j Jtty = 0 for all j =£ i, and we find that M A 2 > cz ^f 2i . 
Therefore the following relationship must be satisfied: 

S<p Iv eY®E u -^ x jT ( 2 y (5.17) 

i 

for all <p e ^ and all y v . If is the support of W 2 then (5.17) is equivalent to 
S(3tt x X J? W2 ) c X © £ u ^ x (5.18) 

i 

Conversely, if (5.18) is satisfied, then it follows that (5.13) and therefore 
(5.3b) will be satisfied. Certainly we need to guarantee that (5.1) and (5.4) 
must also be satisfied in addition to (5.5), that is, 

tr(S(Wi x W 2 )S + (1 x F 2i )) = tr.WF© 

= tr 1 (Wi£ li ) + - E)F U ( 1 -£)) 1 ' 

must be satisfied. For W x = P (p where (p e F li :^\ the following condition 
must also be satisfied: 

tr (S(P 9 x IF 2 ))S + (1 x f 2i )) = 1, 

that is, (1 x F 2i )S(px v = S(p% v must be satisfied for all y v . From these con¬ 
siderations we obtain the following condition which is stronger than (5.18): 

S(E U M[ x ,yf Wl ) cz E U J% x .j>f ( 2 V for all i, (5.20) 

and 

S(( 1 - £),#, X -/4 2 ) cz £ © E u 3#l x -_r ( 2 V. 

i 

From (5.20) it follows that (5.5) holds. 

The conditions (5.20) are therefore necessary and sufficient in order to 
obtain a measurement of the first kind from S, W 2 and Z 2 L(J^). 

From S(M\ x j^) cz x (]T ; ® J^ 2i y ) it follows that the only portion 
of the observable £ 2 ^ L(j^) which is essential lies in the subspace 
In X; © however, the observable Z 2 L(j^) behaves like a decision 
observable! We may therefore replace it by the decision observable generated 
by the projections £ ( 2 V on • Therefore a measurement of the first kind 
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can only take place if, for all “practical purposes” we measure an atomic 
decision observable for system 2. Nevertheless Z 2 L() need not be a 
decision observable-—it only needs to have the structure described by (5.5). 

If, however Z 2 -4 G(J^) is a decision observable, that is, the F u in (5.5) 
are projection operators, then instead of (5.20) we obtain the following 
necessary and sufficient condition for a measurement of the first kind: 

S(F U J% x c F u x for all t. (5.21) 

For the case of a decision observable S 2 G(^) we define an idealized 
measurement of the first kind to be one for which the relations 

XiaJW, = F U W,F U (5.22) 

are satisfied for the atoms er ; of S 2 and for all W t e K(M\). From (5.22) it 
follows that (5.3a) holds for all a. 

Here (5.22) is satisfied for all W e if it is satisfied for all P v for 

which (p e Since 

= RiKl X FlfWPv X IF 2 )S + (1 x Fl' 2 )] 
it follows from (5.21) that for all cp e F U M[ 

(1 x FU 2 )S(P V x IF 2 )S + (1 x F 2 { 2 ) = S(P V x W 2 )S + . 

On the other side, F u P (f) F u = P 9 . Therefore, according to (5.22) 

P q = RiS(P v x W 2 )S + 

for which we conclude that S(P q x W 2 )S + = P q x W 2 for a suitable W 2 . 
Together with (5.21) it follows that S behaves as an operator of the form 

S = £ F u x A 2i (5.23) 

i 

in the subspace x :W Wl of ^ x M 2 where the A 2i are isometric maps 

The ideal measurements of the first kind are a mathematical example in 
which the mappings T(l, 2; W 2 ) and S(l, 2; W 2 ) can be expressed in a simple 
form. If (5.23) holds then T(l, 2; W 2 ) is given by 

ft = T(l, 2; W 2 )F 2 = X F u tr 2 (W 2 A 2i F 2 A 2i ) 

i 

and S(l, 2; W 2 , F 2 ) is given by 

S(l,2; W 2 , F 2 )W j ='£F u W 1 F u tr 2 (W 2 Ai t F 2 A 2 d. 

i 

The example defined by (2.3.3) and used in §§2.3, 3.2 and 4.2, for the case in 
which Xv = <Pv and the are pairwise orthogonal is a special case of (5.23), 
namely 

S = E p v. x A v where = »fv. 
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The proof that the requirements for a measurement of the first kind and for 
ideal measurements of the first kind do not contradict the fact that the 
scattering operator is unitary does not mean that such scattering processes 
are physically possible because the scattering operator cannot be arbitrarily 
chosen but is a consequence of the interaction Hamiltonian operators. We 
have not shown that the requirements for measurements of the first kind or 
for idealized measurements of the first kind do not contradict the interaction 
description described in VIII. In this connection a series of research papers 
[27] are of interest because they show that conservation laws (such as the 
energy conservation law) already frustrate the possibility to satisfy all the 
conditions for measurements of the first kind, that is, contradict the require¬ 
ments for measurements of the first kind (except for measurements of the 
conserved quantities). Despite this situation, in certain limiting cases these 
conditions can be “approximately” satisfied. 

Although there are no measurements of the first kind for sufficiently many 
observables, this situation does not invalidate the possibility towards making 
measurements at all and is far from invalidating quantum mechanics although 
that impression can be obtained from earlier works which claim that the 
possibility of measurement of the first kind is a crucial issue for quantum 
mechanics. The special consistency of the use of scattering for the purpose of 
registration and preparation in quantum mechanics is assured, that is, does 
not lead to contradictions. The complete consistency of the viewpoint 
presented in II, III with a theory of the measurement process itself is only 
provable if we use a theory of macroscopic systems, a subject we shall return 
to in XVIII, and will be described in detail in [7], 


6 The Physical Importance of Scattering Processes Used 
for Registration and Preparation 

In this book the special consistency problem presented in §§2-4 is of particular 
importance. In fact, we have described physical processes which also play 
an important role in experiments with microsystems, a topic which is largely 
neglected in most books on quantum mechanics. The reason for this neglect 
arises from the fact that most theoretical physicists consider themselves only 
responsible for providing a theoretical description of the “objective structure 
of microsystems”, and that the problem of experimental physicists is to make 
measurements of the theoretically predicted results. Thus, applications of the 
theory described in §§2-4 are, for the most part, only found in books about 
experimental physics, although they will not generally be expressed in terms 
of the abstract formulation presented in §§2-4. Actually, it is possible to fill 
many volumes with such applications. 

Since the description of the measurement process in quantum mechanics 
in terms of “pre-theories” is not possible, the “usual” neglect of a quantum 
mechanical description of the various experimental measurement procedures 
(or at least of portions of them) cannot be justified. Unfortunately, because 
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of size limitations, we will not attempt to provide examples of the approximate 
computation of measurement scattering morphisms, measurement trans¬ 
formation morphisms or transpreparators in this book. In order to illustrate 
the importance of the theory described in §§2-4 for experiments we shall 
provide a few simple practical applications. 


6.1 Sequences of Measurement Scatterings and 
Measurement Transformations 

The gedanken experiment described in §2.1 shows that several of the 
processes described in §§2-4 can be carried out consecutively. In the example 
given in §2.1 first there was a measurement scattering of electrons by light 
quanta, then there was a measurement transformation of light quanta by the 
lens, and finally the registration by the photographic plate. 

Symbolically it is possible to represent such a sequence by means of 
Figure 46. In this diagram 1 represents the microsystems which are to be 
registered. First, these microsystems are scattered by systems 2. Then the 
systems of type 2 encounter systems of type 3 (which may represent an external 
field) for the purpose of undergoing a measurement transformation. The 
systems of type 2 are then registered according to the procedure b 0 . 

In quantum mechanics this process can be represented in Hilbert space by 
the scattering operator 

S = (l t x S 23 )(Si 2 x 1 3 ) (6.1.1) 

(at least with respect to the ensembles used for systems 2 and 3) where S 12 
is the scattering operator in x 2tf 2 for the scattering of systems 1 on 2 and 
S 23 is the scattering operator in .# 2 x j^ 3 for systems (2, 3). From 

tT 2 (W' 2 F' 2 ) = tr 23 (S 23 (IF' 2 x W 3 )SUF 2 x 1 3 )) 
we obtain the measurement transformation morphism 

T(2, 2; W 3 )F 2 = F 2 . 



Figure 46 
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From 

tri (WtFJ = tr 12 (S 12 (Wi x W 2 )St 2 {\ x F 2 )) 
we obtain the measurement scattering morphism 

T(l, 2; W 2 )F 2 = F v 

Thus, from the scattering of 1 by the system pairs (2, 3), using the operator 
(6.1.1), from 

tr 1 (W 1 F 1 ) = tr 123 (S(W\ x W 2 x WyS+0! x F 2 x 1 3 » 
we obtain the measurement scattering morphism 
T(l, 2,3-W 2 ,W 3 )F 2 = F 1 . 

From 

S(W, x W 2 x W,)S + = (li x x W 2 )St 2 x IF 3 ](1, x S 2 + 3 ) 

we obtain the following relationships in a stepwise manner: 
tri 23 (S(Wi x W 2 x PF 3 )S + (1, x F 2 x 1 3 )) 

= tr.asCa, x S 23 [S I2 (Wi X PF 2 )S, + 2 x W,]( 1, x S 2 + 3 Xli x F 2 x 1 3 )) 
= tr 12 (S 12 W x H 2 )S 1 + 2 [T(2, 2; W,)F 2 ]) 

= tr 1 (FF' 1 [T(l, 2; 1F 2 )T(2, 2; IF 3 )F 2 ]). 

Therefore, in this case we obtain 

T(l, 2, 3; W 2 , W 3 ) = T(l, 2; IF 2 )T(2, 2; W 3 ). (6.1.2) 

There is no difficulty extending the above results to sequences of arbitrary 
length, as described by Figure 47. Here we obtain 

T(l, 2 ,...,n;W 2 ,...,W„) 

= T(l, •; W 2 ) T(-, •; W 3 ) • • • T(-, •; 1F V )T(-, •; W v+ ,) • • • T(-, •; W n ). 

(6.1.3) 

This result has played an important role in the historical development of 
quantum mechanics. It is possible to separate the chain in Figure 47 any¬ 
where (i.e. between the vth and (v + l)th) and consider the remainder as the 
measurement. Of course the fact of the possibility of separation was not 



2 3 4 v v + 1 n 


preparation procedure registration procedure 

Figure 47 
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Figure 48 

historically proven by equation (6.1.3). From this equation it is trivial to 
prove that it is possible to divide such a chain arbitrarily into preparation 
and registration (see Figure 47) and remain consistent with quantum 
mechanics. Here we do not intend to dwell on the “special consistency” in the 
remainder of this section. Here we only intend to symbolically represent the 
experience of experimental physicists by means of pictures of the form of 
Figure 47. 

In addition to the registration apparatus at the end of the chain, other 
registration apparatuses may be used at other points in the chain, as illu¬ 
strated by Figure 48. The three apparatuses b 0i , b 02 ,b 03 in Figure 48 can be 
considered to be a'registration method b 0 with respect to the systems of type 1 
entering from the left. If we identify these three methods by three observables 

I, L(J^), £ 2 £3 L(J4? 3 ), we then obtain an observable 

£ L(J^) as follows: 

tr a a 2 a <r 3 )) = tr 123 (S 13 [S 12 (IF 1 x W 2 )St 2 x W 3 ] 

■Si 3 (F(° 1) x ^2(0-2) x F 3 (a 3 ))) 

which is defined on the free Boolean ring £ generated by £ x , £ 2 , £3 (see AI, §3). 

These examples suffice to show the multiplicity of such sequences for the 
construction of registration methods. 

6.2 Physical Importance of Measurement Scattering and 
Measurement Transformations 

We shall now exhibit examples of measurement scattering and measurement 
transformations. These examples only serve to illustrate the meaning of 
these well-known experimental procedures under this viewpoint. Further¬ 
more, it is possible to construct more complicated apparatuses from individual 
known experimental procedures which can be used for experiments. 
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The use of “external electromagnetic fields” for the purpose of the measure¬ 
ment transformation is extensive. An example is the use of electric and 
magnetic fields for the deflection of a beam of charged particles for the 
“determination of momentum” and the measurement of the quantity e/m. 
We will now illustrate the latter experiment. 

We will use the form of the Hamiltonian operator given by VIII (6.31). We 
will assume that we may produce ensembles which do not have broad 
position and momentum dispersion and that we are dealing with fields which 
are not varying rapidly in order that the time dependence of the expectation 
values of position satisfy Newton’s equations of motion for a mass-point: 

mr = e(E + r x B). (6.2.1) 

Derivations of this equation from quantum mechanics can be found in many 
of the quantum mechanics textbooks; see, for example, [6], XI, §1.2. 

After the interaction with the field has taken place, it is possible to measure 
the effect associated with microsystems striking the surface F (see XVI, §6.1). 
From (6.2.1) it follows that the quantity e/m can be determined from these 
measurements. A precise description of such an apparatus cannot be given 
here. 

In other well-known apparatuses, fields are used for the determination of 
the energy. Other examples of the application of fields are the electro¬ 
magnetic lenses used in electron microscopes. We also draw attention to the 
Stern-Gerlach experiment in which an inhomogeneous magnetic field is 
used in order to transform the effect of striking a surface into a measurement 
of spin. Although this example is relatively simple, its theoretical description 
in terms of quantum mechanics is already complicated and can only be 
carried out in an approximate manner (see, for example, [6], XI, §7.2). A 
comparison of such a theoretical description of the Stern-Gerlach experiment 
with the description given by the experimentalists (which is also theoretical) 
will be very informative. It will show the skills required by the experimentalist 
to construct with minimum possible work a usable (but not better than 
necessary) approximate theory for the description of the processes in his 
measurement apparatus. The bending of the beam in the external magnetic 
field in the Stern-Gerlach experiment is described in a form which is simplified 
as much as possible in order to exhibit the “connection” of the splitting of 
the beam with the spin and magnetic moment of the atoms. This “connection” 
is nothing other than what we have called a measurement transformation 
morphism. 

Scattering by other microsystems will frequently also be used (even in cases 
in which the complications of the theoretical description are extensive) for 
the purpose of measurement transformations, for example, in order to 
measure the “polarization” (spin direction). 

It is impossible to enumerate all examples in which measurement scattering 
is used. We shall only mention a few examples: Scattering of atoms or 
molecules (systems 1) by electrons (systems 2), and the subsequent registra¬ 
tion of the electrons which are interpreted as measurements of the atoms or 
molecules. An example of historical importance is the Frank-Hertz experi- 


7fttt£fccj<t giumf Pfcy iioi 



6 The Physical Importance of Scattering Processes 345 


ment in which the energy levels of atoms are measured. Here, in addition to 
the scattering of the electrons, a deceleration field is used as a measurement 
transformation. Scattering of neutrons (systems 1) by atomic nuclei (systems 
2) are used to register neutrons with the aid of effects associated with the 
atomic nuclei. 

Measurement scattering and, in particular, measurement transformations 
are often used in subtle ways in order to increase the accuracy of experiments, 
even though the final registration cannot be pfecisely described. The example 
given in §2.1 of the microscope is a good illustration. The fact that the process 
of the sensitization of the silver bromide crystals in a photographic plate is 
not well Understood and the imprecision in the location of the silver grains 
is unimportant because the theoretically derived resolution of the microscope 
yields the essential imprecision if the magnification is sufficiently great. Then 
we may replace the effect of the blackening of the silver bromide crystals by 
the idealized effect of the incidence of a photon on a surface element of the 
plate. 

Despite all the “tricks” in the application of measurement scattering and 
measurement transformations, the following problem remains for the 
experimental physicist: the fact that microsystems must finally be registered 
by a macroscopic registration apparatus, that is, by actual processes on the 
macroscopic system. The technological possibilities of such macroscopic 
registration apparatuses are very limited. By the use of measurement 
scattering processes and measurement transformations it is possible not 
only to improve the precision of measurements but also to essentially increase 
the set of experimentally usable registration procedures. 

This increase in the possibilities for measurements is not only important 
from the experimental point of view but it is also indicative of a theoretically 
important problem, which we cannot ignore in this book because of its 
relevance for our description of quantum mechanics. 

One of the unpleasant aspects of the theory presented in XVI-XVII is that, 
for certain effect procedures the corresponding effect operators must be 
“guessed” and then asserted as axioms. Since quantum mechanics does not 
permit direct measurements (that is, measurements which may be described 
in terms of pre-theories alone—see I and [8], §10), there was no other way 
from II to XVI. 

There are, however, two decision observables, which are obtained in a 
purely theoretical way from the physical interpretation of Galileo trans¬ 
formations for elementary systems: Position and Momentum. Clearly the 
structure of the corresponding apparatuses is not theoretically determined 
(see IV, §4). It is, however possible to test whether an apparatus is built in 
such a manner as to satisfy the claims for measurements of position or 
momentum. In this sense the position and momentum observables of 
elementary systems can be considered to be observables within the frame¬ 
work of the theory developed in II-X. 

As we have seen above we may obtain new registration procedures by the 
concatenation of measurement scattering processes and measurement 
transformations. Such new registration procedures may be totally described 
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only by quantum theory (using the interaction Hamiltonians in VIII) if the 
final registration in the chain is a registration of the position or momentum 
of an elementary system (or of an impact of such a system on a surface 
2F described by XVI (6.1.13)). What is the subset L r (^) c L( ) which can be 
theoretically achieved by such chains? In terms of the theory there is no need 
for additional axioms about concrete effect procedures and the corresponding 
effects F e Lpf), providing that one is satisfied with the effects of this subset 
L r (Jf). We could be more completely satisfied if we could show that the 
subset L r (Jf) so obtained is sufficient to approximate every physical effect 
of L{:Fff ). We are, however, far from a solution of this problem. 


6.3 Chains of Transpreparations 

The total sequence illustrated in Figure 48 can also be considered to be the 
preparation procedure for the final ensemble entering into the apparatus b 0 . 
If we use the observables X 2 H X 3 5 L(.^j) for the description of the 

registration methods b 02 , h 03 , then the sequence illustrated in Figure 48 can 
be considered to represent a transpreparator X 23 4- IT, where X 23 is the 
Boolean ring freely generated by X 2 , X 3 and y is defined by 

1 (a 2 a <r 3 ) = S(l, 3; W 3 , F 3 (cr 3 ))S(l, 2; W 2 , F 2 («t 2 )). (6.3.1) 

Here it is again trivial to write transpreparators for arbitrary long sequences. 
In addition, it is possible to pass over from systems of type 1 to systems of 
another type by scattering, and consider the entire chain to be a preparation 
process by which systems are obtained from the final link in the chain. In 
addition, individual elements of the sequence may consist of the use of 
external fields. In the latter case, the transpreparator takes on the following 
form: 

SW = SWS + , 

where S is the scattering operator for the external field. 

According to quantum mechanics it is, in principle, possible to compute 
the transpreparator for complicated sequences. In order, however, to obtain 
a preparator, it is necessary that the beginning of such a sequence be a 
macroscopic apparatus. If such is characterized by a preparator X x K(M[), 
then with the aid of the transpreparator y defined by (6.3.1) we obtain a 
preparator X X where X is the Boolean ring freely generated by 

X t , X 2 ,X 3 and rj is given by 

ri(ff 1 a <r 2 a a 3 ) = y(<x 2 a <x 3 >h(ffi)- (6.3.2) 

If we have a longer sequence, then we can cut the sequence at an arbitrary 
place, and interpret the “left-hand part” as the “starting preparator” 

Xi ^ £(J*0- 
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6.4 The Importance of Transpreparators for the 
Preparation Process 

It is possible to fill volumes with the description of preparation apparatuses 
which are used for transpreparators and also sequences of transpreparators. 
Again we must be satisfied by mentioning a number of applications demon¬ 
strating the importance of transpreparators for experimental physics. 

The great accelerators are probably the best-known examples of prepara¬ 
tion apparatuses in which external fields, often in subtle form are used for 
transpreparations. In other apparatuses scattering of microsystems is often 
used in order to generate other system types, for example, certain ions, 
neutrons and short-lived radioactive nuclei. 

In §6.2 we have noted that experimental physicists have a knack in choosing 
suitable measurement scattering processes and measurement transformation 
processes such that the imprecise theoretical understanding concerning the 
last registration apparatus will be meaningless. Similarly experimental 
physicists suceed with the aid of transpreparations, to obtain a theoretically 
well-defined ensemble even though at the beginning of the sequence we began 
with a poorly defined ensemble. 

Transpreparators such as measurement scattering and measurement 
transformation morphisms are essential items for the verification of quantum 
mechanics since in practice we have very few macroscopic apparatuses which 
either product microsystems or finally register them. 

Is it possible (by analogy with the introduction of position and momentum 
observables) to theoretically define certain initial ensembles from the theory 
developed in I-X (without the need for the inelegant axioms which were 
defined, for example, in XI, §2) in order that we obtain the largest possible 
subset of K by use of the transpreparators? Such an implicit definition 
of ensembles (such as the implicit definition of position and momentum 
observables) is not currently available. If we would have a sufficient large 
subset L ex c L of effects such that L ex separates the ensembles (that is, 
K w i, 9) — K w 2 ’ 9 ) for a ll 9 e L ex implies that w 1 — w 2 ), then it would be 
possible to test the produced ensembles with the aid of the effects in L ex . 

From this viewpoint it is understandable that in most presentations of 
quantum mechanics the primary emphasis is not in the preparation problem 
but in the measurement problem because in making measurements it is 
always possible to test ensembles. This testing can, in many cases, be very 
simple. Indeed, in many cases it suffices to test an ensemble using a single 
effect F in order to determine W. 

It is clear that mixtures of ensembles are not as well suited for the purpose 
of experiments as the components of the mixtures (see III, §2). For this 
reason experimental physicists generally do not use direct mixtures (III, D 2.3) 
of preparation procedures. 

The most desirable ensembles from an experimental physicist’s point of 
view will be those which correspond to the extreme points of the set K (see 
III, D 6.1 and III, Th. 6.5) and are called “irreducible ensembles” or “pure 
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states”. Here we shall avoid the expression “pure state” because this notion 
is often misunderstood—it gives the impression that all systems of a pure 
state are “identical”. (This notion leads to peculiar difficulties and to problems 
in logic.) Elements x of a preparation procedure a for which <p(a) is an 
irreducible ensemble can be distinguished by the effects produced by them, 
that is, by the registration procedures b of which they are elements. Micro¬ 
systems are not physical objects (see IV, §8) and therefore cannot be separated 
into “objective identical” categories. 

Experimentally, the extreme points of K have the desirable property that 
they can be tested by means of a single effect. Suppose that we have con¬ 
structed an effect procedure ( b 0 , b) for which ij/(b 0 , b) = P v (see III, Th. 6.5), 
then we may easily test whether a WeK is equal to P q because from 
tr(IVP 9 ) = 1 it follows that W = P, r In such cases we do not need many 
effects in order to test W. Therefore, if we can control the registration, we may 
then easily test whether we have prepared an irreducible ensemble. How then 
may we prepare such irreducible ensembles ? 

Clearly with the aid of scattering processes and registration procedures, 
that is, with well-chosen transpreparators! If we control the registration, 
then we may seek a scattering process and a particular observable Z 2 L(-/f 2 ) 
such that, for certain elements a t of X 2 , the ensembles W u given by 

W u = S&2;W 2 ,F 2 (od)W 1 (6.4.1) 

are irreducible ensembles. Of course, such a selection will unfortunately not 
only depend upon the choice of the observable X 2 L(J4? 2 ) and the scattering 
process but also upon the ensembles W 1 and W 2 . The advantage of such an 
application of (6.4.1) can be the fact that the methods for the production 
of ensembles W u W 2 are already known, but that there is no method for the 
production of the desired irreducible ensemble W u . For example, it is easy 
to produce atoms (as systems 1) in the ground state. By scattering with 
electrons (system 2) it is possible to excite the atoms. By choosing W 2 to be an 
ensemble which does not have much dispersion in the energy (for example, by 
accelerating the electrons in an external field), then by measuring the energy 
of the electrons after the scattering (that is, the observable X 2 L(Jf 2 )) 
we may “determine” into which state the atom is excited, that is, we can select 
an ensemble W u of atoms in a particular excited state. 

We have presented this example in order to make it clear that the trans¬ 
preparations are not generally associated with registrations in the sense of 
measurements of the first kind (§5). 


6.5 Unstable States 

Unstable states are frequently found in the realm of atomic dimensions. The 
most obvious example is that of radioactive atomic nuclei. The “excited 
states” of atoms and molecules described in XI-XV are also unstable states, 
which undergo transitions by the emission of photons. 
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There is an extensive literature devoted to unstable states [28], Since 
somewhat complex interactions are required in order to investigate unstable 
states, it is understandable that the study of unstable states has lead to a great 
variety of approximation methods. We shall not attempt to describe these 
methods in this volume. 

An attentive reader of the literature will, despite the matters of detail, 
readily become aware of the fact that there are deep conceptual uncertainties 
associated with the quantum mechanical treatment of unstable states. Indeed, 
the concept of an “unstable state” has not been precisely formulated. This is 
precisely the point where we must use the theory presented earlier in order to 
place this problem in correct perspective. 

At first we ask whether “unstable state” refers to the notion of state 
defined in III, D 1.1. In this volume we have often used the expression 
“ensemble” instead of “state”. Since this problem is indeed concerned with 
“certain” ensembles, the problem of “unstable ensembles” is a preparation 
problem. 

Many of the attempts to solve this problem result from the widespread 
notion that the observables and states are attributed to the miscosystems, 
avoiding the fact that the problem of unstable states must be a preparation 
problem, that is, a problem of producing ensembles with the aid of prepara¬ 
tion procedures. Often physicists are unaware that the production of such 
ensembles is intimately tied up with the preparation process and cannot be 
separated from the latter. The usual approach is to seek a formulation of 
quantum mechanics, the concepts of which should be defined only in terms of 
self-existing properties of microsystems—properties which are detached 
from all apparatuses. As we have already seen, such a formulation of quantum 
mechanics is unrealistic. Any attempts in this direction must ultimately fail. 
In this book we have emphasized the fact that it does not make sense to define 
observables without taking into account the apparatus. Despite this we 
have often “guessed” the effect operators without knowing the detailed 
structure of the apparatus but using only a rough knowledge of the mode of 
operation of the apparatus. In this way we have, for example, guessed an 
approximate operator for the effect of the impact of a system on a surface 
(see XVI, §6). 

If, after considering the examples from quantum mechanics described in 
XI-XVI, we remain unaware of the fact that effects such as ensembles for a 
given procedure are “guessed”, then we may easily be deluded into believing 
that these “guessed” quantities are objective quantities (that is, independent 
of the procedures used) and that the experimental physicist has to find 
apparatuses which measure these quantities. If, on the other hand, we are 
aware, that it is impossible to separate the concepts of an observable and a 
state from the procedures used, then we will not be under the illusion that it is 
possible to introduce the notion of an “unstable state” independent of the 
preparation procedures. 

If we consider the experimental situations in which unstable states occur 
then we discover how easy it is to produce such ensembles. In contrast to this, 
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it appears to be theoretically difficult to specify preparation procedures for 
unstable states and their corresponding operators We K. One frequently 
used possibility for the preparation of unstable states is by “irradiating” 
microscopic systems, that is, by using scattering processes. 

Scattering processes have already been extensively discussed in XVI. 
Should we have already treated unstable states there? Yes and no. Yes, in the 
sense that we could have already considered the more complicated structures 
associated with scattering operators than we have already done. No, because 
it is not as simple as in the case of the introduction of the “effect of micro¬ 
systems striking a surface” to discover the operators which accurately 
describe the experimental process of the transpreparation for unstable states. 
Intuitively the procedure of irradiation can be roughly described as follows: 
After the irradiation, some of the miscrosystems are left in unstable states. In 
more detail, after the scattering process we are left with an ensemble of 
microsystems, which is generally a mixture which can be decomposed into 
components, one of which is that of an unstable state. The latter is made 
evident from the fact that, at a later time, microsystems are emitted, usually 
obeying an experimental decay law: The number of nondecayed systems is 
given by N(t ) = JV(0)e -(/r ; the quantity x is called the mean lifetime. 

In order to provide the principal aspects of this problem we shall consider 
a simple example, For system 1, which we will “irradiate” with systems of 
type 2, we shall consider a composite system consisting of two elementary 
systems, one of which is so massive that we may describe it by means of an 
external potential field V(r). We shall assume that V(r) has the form as given 
in Figure 49. We may then describe system 1 as an elementary system moving 
in the potential field V(r). Systems of type 2 which are used to irradiate may 
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have a repulsive interaction W(r l2 ) with system 1, and will have no inter¬ 
action with the “heavy” system which produces the field V(r). The 
Hamiltonian will therefore be given by: 

+ (6.5.1) 

We shall assume that the operator H 10 = + V(r t ) has a single 

discrete nondegenerate eigenvalue — E 0 with corresponding eigenvector 
<Po(ri) which we shall call the “bound ground state”. We shall assume that the 
initial ensemble has the form x W 2 . Therefore in the Heisenberg picture 
we have to set W = Q _(£,,„ x W 2 )Qt (see VI, §5). We assume that the dis¬ 
persion in energy of W is much smaller than E 0 , and that the mean energy is 
greater than V m + E 0 where V m is the maximum of V(r). Then 

W=fi_(P„ 0 x W 2 )Qt 

describes the “elastic” scattering of system 2 as well as the inelastic scattering. 
In elastic scattering system 1 remains in the ground state; in an inelastic 
scattering system 1 will be expelled and can be detected far from the scattering 
system by a detection surface. 

In an inelastic scattering it is possible that a “nearly bound state” of energy 
£, (see Figure 49) exists in which system 1 may be found long after the 
scattering. This is an example of an unstable state. E 1 would be a bound state 
if the potential V(r ) does not sink below E 1 for large r. Here we speak of the 
“tunnel effect” in which system 1 “tunnels” through the “mountain” of V(r). 
A classical mass-point of energy £, cannot climb over the mountain, that is, 
is “imprisoned” by the potential, even in the case in which the potential 
drops below E l for large r. 

How is it possible to prepare an ensemble corresponding to the “unstable 
state” having energy E 1 intuitively described above? 

We can detect the systems 1 and 2 after scattering by the effect of striking a 
spherical surface of radius R. Let T denote the time before which all the 
systems of type 2 have struck the surface. The experiment shows that systems 
of type 1 strike the surface after T and that they have nearly the energy E u 
and that the number of system 1 striking the sphere per unit time decreases 
with a time dependence e ~ tn . In this experiment R must not be too large. The 
following condition must be satisfied: 

R< (6.5.2) 

m 2 

where k x is defined by £, = k\/2m u that is, k 1 /m 1 is the mean velocity of the 
systems 1. 

Experimentally the “unstable state” of systems 1 can be singled as follows: 
All systems of type 1 are selected where the systems of type 2 have lost more 
energy than £ 0 and have struck the sphere before time £ but none of the 
systems of type 1 have struck the sphere. This experimental situation charac¬ 
terizes a preparation procedure a for which (p(a)eK(Jtf [) determines the 
so-called “unstable state”. How do we determine the operator <p(a)? 
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First, on a purely quantum mechanical basis, from the Hamiltonian 
operator H (6.5.1) it is possible to compute the operation T c which maps 
P„o x ^2 on t° a W e x .#>), which, for large distances r 2 is identical 

to the outgoing waves for the inelastic scattering. 

Using the expressions in XVI, §6.2 for the elfect of the systems striking a 
sprface, we may calculate the probability that systems 1 will strike the 
spherical surface in the time interval t x to t 2 . For t > T this probability 
should approximately have the time dependence e~' /r . T c does not, however, 
determine W = cp(a ) with the above defined a. We need an additional 
operation T which maps T c (P qil x IF 2 ) onto a # e K(M\) and corresponds 
to selecting the cases in which no system 1 strikes the surface up to time T. 
The knowledge of the “guessed” operator for the effect of striking of system 1 
up to time T is not sufficient in order to determine T. In order to compute T 
we need a theory of the interaction of the microsystem with the material of 
the real sphere, that is, with a macroscopic system. Therefore it is very 
understandable that we avoid this physical problem by making some 
idealized postulates about T. We have to ask whether such postulates are 
realistic. We shall later show how we may, by guessing, obtain completely 
unrealistic results. 

As we have seen above, the problem of obtaining a preparation procedure 
for “unstable states” clearly shows the following: First, that it is not possible 
to define unstable states by means of a limiting process as we have done for 
the case of scattering, because, in the limit t -* oo nothing remains of the 
unstable states. Second, there is no unique definition because the unstable 
states prepared depend precisely on the preparation procedure a that was 
used. The physical distinction between such ensembles produced by different 
a’s is so minimal, that any definition method for unstable states is admissible 
(for example, on poles in the complex plane of the scattering amplitude) if it 
is practical and we obtain the physical same results as for the different 
experimental preparation methods a. 

In the above example <p(a ) does not, for all practical purposes, depend upon 
R and T if (6.5.2) is satisfied and R and T are neither too small or too large. 

Providing that we realize that there are only approximate methods for the 
definition of unstable states, we will be able to avoid (as mentioned above) 
making errors in the guessing of T. Now we wish to demonstrate an error 
which results in a misunderstanding of quantum mechanics in that measure¬ 
ments are decisions on propositions rather than about processes which must 
be described in physical terms. In this misunderstanding it is believed that 
ideal measurements of the first kind are the only “true” measurements, that is, 
those which correspond to decisions on propositions. Then it is believed that 
all measurements in physics must refer back to ideal measurements of the 
first kind. Then it becomes possible to believe that a measurement that a 
system which has not passed through the spherical surface in the time interval 
t to t + At can be idealized to a decision that if the system is in the sphere at 
time t it will also be there at time t + At. If E(f) is the projection (in the 
Heisenberg picture) for the decision effect that the position at time t is within 


Pniie. 7lltt£fccj<t giumf Pfcy iioi 



6 The Physical Importance of Scattering Processes 353 


the sphere, then the idea that the particle does not pass through the sphere 
during the time tj to t 2 has to be replaced (with At = (t 2 — f j/n) by a 
segment of a subsequent ideal measurement of the first kind that “the position 
ofthe particle at all times t + mAt(m = 0,1,... ,n) must lie within the sphere”. 
Then the corresponding operation will, according to §5 be given by 

W-+ E(t 2 ) ■ ■ ■ E(t 1 + At)E(t 1 )WE(t 1 )E(t 1 + At) 

• £(f + 2At) • • • E(t 2 ) (6.5.3) 

We may prove that, in the limits At ->■0 (that is, n ->• oo) from (6.5.3) we 
obtain an operation T t2i tl for which 

m 2 , tl W tI ) = tr (£(t 1 )»J I £(t 1 )) (6.5.4) 

holds. If the system is in the sphere at time t, then it will be there at all other 
times—it cannot come out. The probability that the system is in the sphere is 1 
at all times, although it must, according to experience, decrease according to 
e -r/t . If we set T = T (2j(i for sufficiently small t t and sufficiently large t 2 , 
then we will be in contradiction with experience. Apparently we have imposed 
idealized requirements upon T which are absolutely unrealistic, since these 
requirements implicitly presuppose that the spherical surface reflect the 
systems 1 instead of (in actual experiments) absorbing them and detecting 
them. Unfortunately, this situation is described as a paradox, although it is 
not one. The fact that an idealized registration defined by means of imposing 
requirements is very different from an actual registration of the incidence of a 
system has nothing to do with a paradox but only shows that it is physically 
unrealistic to describe all experiments in terms of decision effects and of ideal 
decision measurements of the first kind. 

That (6.5.4) holds can be proven exactly [29]. We shall only make (6.5.4) 
plausible. From 

E(t + At) = £(t) + iAt[FIE(t) - E(t)H) + At 2 ■■ ■ 
it follows that 

E(t + At)E(t)WE(t)E(t + At) 

= E(t)WE(t) + iAt[HE(t)WE(t) - E(t)HE(t)WE(t) 

+ E(t)WE(t)HE(t) - £(t)W£(t)H] + At 2 • • •. 

From which we obtain 

tr (E(t + At)E(t)WE(t)E(t + At)) = tr(E(t)WE(t)) + At 2 ■ ■ ■. 

In the limit n-* oo, for n steps, we obtain (6.5.4). 

The fact that for a realistic treatment of the preparation of unstable states 
it is necessary to introduce an absorbing spherical surface can be experi¬ 
mentally verified as follows: Suppose, for example, we have an excited atom 
in the vicinity of a dielectric surface (a macroscopic system which does not 
absorb photons), then the lifetime t, that is, the transition probability for the 
emission of a photon is changed. An unstable state is always characterized by 
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an absorbing neighborhood, which, must, of course, be taken into account. 
This makes it “easy” to produce unstable states. 

The reader may compare the above considerations to the classical absorber 
theory of Wheeler-Feynman [30] and its application to quantum mechanics 
by Siissmann [31]. 


7 Complex Preparation and Registration Processes 

In previous discussions (§§2-6) we described processes for which there is a 
scattering operator. For more complicated processes the mathematical 
description is more difficult. For all collision experiments, however, there 
exists a wave operator which maps the Hilbert space of the incoming 
channels into the Hilbert space of the interacting total system. In place of 
(2.2.2) we then have the equation 

w = a.(w t x w 2 yat, (7.i) 

where we must pay attention to the fact that in the formula 

tr (WF) = tr(Q_(Kj x W 2 )QZF) (7.2) 

the time dependence of F must be taken in the Heisenberg picture with 
the entire Hamiltonian operator (including the interaction) (see XVI, §4.9). 

• By analogy to the discussion in §2.2, it follows that 

tr(Q_(fkj x W 2 )SltF) = tr^WiFi) (7.3) 

defines a B-continuous effect morphism L(Jf) -> Lp£j). Since Q_ is iso¬ 
metric, it is easy to verify that for W 2 = P#, this effect morphism is surjective 
(choose W l to be an arbitrary P tp and set F = P n w ,), that is, no information 
possibilities are lost. Only in the case in which W 2 is not irreducible is it 
possible to lose information possibilities. 

Scattering of the most complicated type can therefore be used for the 
purpose of registration. Thus, it is not necessary to introduce any new con¬ 
siderations other than those previously introduced in §§2 and 3. 

This short discussion was introduced in order to more easily realize the 
manifold ways in which scattering processes described in quantum mechanical 
terms may be used in order to obtain new possibilities for registration of 
effects by using “known” registration and preparation processes, and to 
invent new preparation possibilities by the use of “known” preparation and 
registration possibilities. 

All the above proves only the “special consistency” of the measurement 
problem with quantum mechanics. The “general” consistency problem, that 
the preparation and registration procedures (as presented in II) are, in 
general, consistent with quantum mechanics, is much more difficult. This 
problem will be discussed in XVIII, §3, and solved in [7]. 

Readers interested in problems described in XVII are referred to the 
book of K. Kraus [42]. 





CHAPTER XVIII 


Quantum Mechanics, Macrophysics and 
Physical World Views 


In closing this book, we shall now recall the route of the development of 
quantum mechanics in II-XVII in order to come to some judgment con¬ 
cerning the place of quantum mechanics in physics as a whole and to discuss 
remaining open problems which cannot be examined in this book. We will 
consider the question whether it is necessary to consider quantum mechanics 
as a revolution in the world view of physics. No other theory has given rise to 
more varied and peculiar notions for the meaning of a physical theory. No 
other theory has produced as much philosophical discussion. 

1 Universality of Quantum Mechanics? 

A more or less known decisive preliminary decision concerning the domain 
of validity of quantum mechanics has played an important role in discussions 
about the philosophical meaning and the physical consequences of quantum 
mechanics. It is the preliminary decision that quantum mechanics is, at least 
in its fundamental principles, universal, that is, is valid for all physics, and 
that all other nonquantum mechanical theories, such as classical, are only 
approximation theories relative to quantum mechanics. 

The belief in the universality of quantum mechanics has its historical 
origins. Despite this, it is necessary to examine this idea with a critical and 
sceptical eye. 

In the beginning of quantum mechanics after a tentative investigation 
towards a comprehensive interpretation of quantum mechanics physicists 
were lead to the usual statistical interpretation based upon the fundamental 
concepts such as “state” and “observable.” We will not attempt to investigate 
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all the more or less precise ideas which have led to these two concepts. In 
practice everything functions well if we are sufficiently practiced, that is, we 
have a certain familiarity of these fundamental concepts. 

This “familiarity” often lets us forget that quantum mechanics formulated 
in this way was not a correct SfiST (see I and [1]) because it did not give any 
indication what observables mean and in what state microsystems are to be 
found. Did quantum mechanics really make no indication about these? 
Certainly it did—with the aid of the correspondence principles. For quantum 
mechanics the correspondence principle was not only a means by which we 
can guess the route to quantum mechanics but was also a part of the quantum 
theory itself, a fact which was clear to more perceptive physicists (see, for 
example, [32]). Indeed, at the beginning of quantum mechanics there was 
nothing left over but to discover the quantum mechanical observables from 
the classical observables using the correspondence principle. Here it was 
unconsciously hoped that the classical mechanical observables, the meaning 
of which is already known, will also determine the meaning of the corres¬ 
ponding quantum mechanical operators (according to the correspondence 
principle). Physicists hoped that the unknown measurement methods for 
the quantum mechanical observables will, in principal, be determined by the 
known measurement methods for the corresponding classical observables. 

Many physicists were so accustomed to this situation that they were often 
unaware that every new operator obtained from the correspondence principle 
(for a measurement method for an experiment) is more correctly a new axiom, 
and hence represents an extension of quantum mechanics to a more com¬ 
prehensive theory and that the correspondence principle does not give a 
measurement method for these observables. Only the more critical physicists 
recognized this gap in quantum mechanics. What is then the general validity 
of quantum mechanics, that is, the universality of quantum mechanics if we 
always must rely upon the use of the correspondence principle and unknown 
measurement methods ? 

Bound up with the ideal that the concepts of “state” and “observable” 
refer only to “micro-objects,” and that measurements consist of the deter¬ 
mination of observable properties of these micro-objects, many physicists 
have not viewed as a defect the need to guess the operators and their ignorance 
of the measurement methods. The latter were regarded as a purely experi¬ 
mental problem. The physical meaning of, for example, the “position,” the 
“momentum,” the “angular momentum,” the “energy” did not seem to be 
questionable. In “practice” the quantum mechanical description appeared to 
function, indeed, even in macroscopic physics, witness the great success of 
quantum mechanics in the explanations of the properties of macroscopic 
systems, particularly for such curious phenomena such as superconductivity 
and superfluidity. Are these successes not a proof of the universality of 
quantum mechanics? Should we not assume that all theories of macroscopic 
systems are limiting cases for the more exact theory of quantum mechanics? 

Although these successes may make the idea of universality of quantum 
mechanics more compelling, it is important to note that these successes are 
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not a proof for the validity of the fundamental principles of a theory. The great 
success of classical mechanics had led to the notion that “all” of physics can 
be reduced to complicated mechanical motion, an idea which, for example, 
Laplace has expressed in his tract on probability theory (see, for example, 
[6], VI, §4). And, of course, as we now know, physics cannot be reduced to 
mechanics. 

Since the adherents of the universality of quantum mechanics do not 
justify their position only on the successes of quantum mechanics, they have 
presented other arguments. Since the author of this book is not an adherent 
of the universality viewpoint, he would not be a suitable advocate for the 
universality position. For this reason, we refer interested readers to the 
original literature (see the extensive bibliography in [33]). 

Even if the adherents of the universality of quantum mechanics avoid the 
problem of elaborating the bounds (limits) of quantum mechanics, they 
necessarily introduce a new difficulty, namely, the need to answer the 
following question: How do we obtain a “valid determination” of measure¬ 
ment results? Here again we find a variety of different approaches, which 
range from attempts to show that probability theory itself gives the valid 
determination to the introduction of consciousness of the observer or to the 
so-called “many worlds” interpretation of quantum mechanics. For such 
attempts we refer readers to the original literature [33] and [34], 

We now recall that the formulation of the foundations of quantum 
mechanics given here began with the notions of preparation and registration 
procedures, and makes no claim of universality because quantum mechanical 
systems are not physical objects in the sense of III, §4.1 (see also IV, §8.1), 
that is, they cannot be objectively described while the selection procedures 
themselves require an objective mode for their description! Mathematically 
this is expressed in the fact that the structure underlying the notion of a 
selection procedure is very similar to that of a Boolean ring while the structure 
of species pseudoproperties is only an orthocomplemented (not Boolean) 
lattice (see III, §4 and IV, §8). Here, are we not placed in the position in which 
the use of selection procedures as our starting point is made doubtful 
because the apparatuses used for preparation and registration are built of 
atoms and therefore must be properly considered to be quantum mechanical 
systems and are therefore not physical objects? Must we therefore switch 
over to the universality notion for quantum mechanics which we have just 
rejected and seek another basis for the foundations of quantum mechanics? 

2 Macroscopic Systems 

We are of the opinion that the foundations of quantum mechanics presented 
here yields a true picture of reality. Instead of finding it necessary to introduce 
fantastic ideas in order to answer the question about the validation of measure¬ 
ment results, we must, on the basis of our requirements about a SPJ' (see I 
and [8]), examine the following actual physical question: How is it possible, 
on the one hand, to objectify the macroscopic systems, that is, how we are to 
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describe them as physical objects and, on the other hand, apply quantum 
mechanics to' macrosystems and be successful in explaining their properties 
(properties in the sense of II, §4.1, not pseudoproperties!)? 

In the usual formulation of quantum mechanics we begin with the funda¬ 
mental concepts of state and observable and note the existence of non- 
commensurable observables as part of the fundamental structure. This 
fundamental structure is determined by the lattice of closed subspaces of a 
Hilbert space, and is often called a quantum logic. In our formulation of 
quantum mechanics the derived concepts of state and decision observable 
are defined in terms of the mathematical description of preparation and 
registration procedures as presented in II. In the usual formulation the 
description of experiments, that is, of preparation and registration, can only 
be done in terms of ordinary language. Therefore it is understandable that 
many physicists consider the notion of “quantum logics” to be more funda¬ 
mental than the “objective” logic of preparation and registration. Do we 
have any experimental evidence that there exist noncoexistent observables 
for macrosystems? On the contrary, is it not the case that the so-called 
fundamental structure of quantum mechanics, the existence of noncom- 
mensurable decision effects has not been experimentally confirmed for 
macrosystems ? Conversely, is it not the case that one of the typical aspects 
of macroscopic systems is that only coexistent observables can be measured ? 
What, then, are the great successes for the application of quantum mechanics 
to macrosystems ? 

First we may question whether the existence of noncommensurable 
observables has been experimentally proven for the case of microsystems. 
This proof has already been given in XVII. The most transparent case, 
although a somewhat idealized gedanken experiment, has been exhibited in 
XVII, §§2.1 and 4.3—the experiment with the microscope and the photo¬ 
graphic plate in the image or focal plane. 

If we seek to realize similar experiments for macrosystems we find that 
there are insurmountable difficulties in order to reach accuracy sufficiently 
high to prove the noncommensurability of the observables under considera¬ 
tion. Or, expressed differently, for two such noncommensurable decision 
observables only those approximation observables (which are not decision 
observables) are realizable (in the sense of IV, §4) which are all coexistent. 
Therefore there can be no claim that the “validity of quantum logic” has been 
experimentally proven for the case of macrosystems. 

The formulation of quantum mechanics based upon the structure given 
in II permits us to see more clearly what is common and what is different 
between microsystems and macrosystems. 

It is not known whether there is any one application of statistics in physics 
which cannot be carried out with the aid of the structure presented in II (see 
also [8], §§11,12). These structures can be applied regardless of whether we 
are dealing with micro- or macrosystems. The distinction between macro- and 
microsystems becomes evident only when we exceed the axioms introduced 
in II. We must therefore ask, in a critical way, whether structure axioms 
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have been introduced in III which are probably not applicable to macro¬ 
systems. 

Some relevant structures have already been discussed in III. There, for 
example, the set C, is defined differently for micro- and macrosystems (see 
III, §1). There Axiom AV 4s (in III, §3) is not suitable for the description of 
macrosystems. Macrosystems, on the contrary, can be described objectively 
in a state space Z, and in terms of a time dependent trajectory z(f) in Z. All 
known descriptions of macrosystems have this form, whether they are 
describing hydrodynamics, the Boltzmann transport equation or Brownian 
motion. We do not have space to outline or explain these cases. A presenta¬ 
tion of such a description can be found in [7] and in [35]; a simplified 
mathematical introduction can be found in [6], XV. 

We shall now attempt to clarify one aspect of this problem: The circum¬ 
stances under which it is possible to objectify the mode of description of 
the macrosystems which are used for the preparation and registration of 
microsystems. First, we note that the measurable attribute for a macrosystem, 
that is, its state zeZ at different times t is defined by pre-theories. For 
example, for hydrodynamics it is assumed that concepts such as position, 
time, density, flow velocity, temperature, etc. are already understood, and 
that we know, on the basis of pre-theories, how to measure these quantities. 
With respect to microsystems we are not (!) in a position to describe the 
so-called observables in terms of pre-theories, as we have already discussed 
in I. 

On the other hand, macrosystems are also produced for experiment, that is, 
they are prepared, and their trajectories are measured, that is, registered. 
Often such a description of the preparation and registration is ignored, in 
particular, when the behavior of the macrosystems is dynamically determined, 
that is, when the trajectories z(t) are determined by their value at a given time, 
for example, by z(0). If the use of statistics is required, then macrosystems 
can be described in terms of preparation and registration procedures. We 
note, however, that it is always possible to describe dynamically determined 
systems in terms of statistics as a special case of the fundamental structures 
introduced in II. Herein lies the generality of the formulation of statistics 
introduced in II. Let M m , J m , 8i 0m , be the sets defined (by analogy with II) 
for the macrosystems under consideration. How do these sets relate to the 
state space description? 

A state space description means that for every system x 6 M m there is a 
corresponding trajectory z(t). For macrosystems z(f) is not, in general, 
defined for all values of t, particularly for arbitrary times t — oo before the 
macrosystem was produced. It is mathematically convenient to assume that 
z(t) is defined only for t > 0 where t — 0 is the time at which the macrosystem 
was prepared. Let Y denote the space of trajectories z(t), where we frequently 
impose certain continuity requirements upon the elements z(t) of Y (see 
[7], [35], [6], XV). A description of macrosystems in state space means that, 
in the theory of these systems, a map M m 4 Tis defined. 

A structure $ of objective properties is defined on Y as follows: Let E be 
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the Boolean ring of subsets of Y (for example, the set of Borel sets if Y has a 
uniform structure). A property structure on M m is defined by 

/= 

which, on the basis of the physical meaning of / is a structure of objective 
properties, that is, between S, £ m ,3t 0m ,3i m the relations given in III, D 4.1.2 
are satisfied. In this case £ is not only “virtual” (see III, D 4.1.1) because £ is 
distinguished by the physically interpreted structure M -4 Y and is therefore 
a set of real properties. 

In [7], [35] it is shown how to describe macroscopic systems more precisely 
in terms of trajectories in state space, that is, how it is possible to define in a 
mathematically more precise manner that a fee 3k m is a registration procedure 
for trajectories z(t ) in the state space. For us it is only important that the b e 01 m 
does not register the systems better than the p e £ and that the pe£ are, so 
to speak, idealizations of the procedures be 3k m . 

Therefore M m , St m , 3$ 0m , 3i m , £ represent an objective description of 
physical objects, namely, the macrosystem under consideration. What, 
however, can this objective description of macrosystems have to do with a 
quantum mechanics of macrosystems? What do we mean by a quantum 
mechanical description of macrosystems? 

By a quantum mechanical description of macrosystems we mean that a 
macrosystem is considered to be composed (according to VIII, §4) of very 
many atom nuclei and electrons. When we assert that this quantum mechanics 
of “many-particle systems” is not a closed theory (not a g.G. closed theory in 
the sense of [8]), but contains many unrealistic aspects, which must be 
resolved by an extension of the theory, thus we assert nothing other than that 
the structure introduced in VIII for composite systems is insufficient to 
describe reality. 

From VIII, §2 it follows that the description of composite systems intro¬ 
duced there requires a certain set of registration possibilities, which just give 
a physical meaning of the representation of the Hilbert space in the form of 
product spaces. Do the assumed registration possibilities exist in the case in 
which the system consists of very many elementary systems? Is it possible to 
measure, for example, the positions and momenta of all these electrons? If it 
were not possible then the description of composite systems in the form 
presented in VIII is only an approximation for the case in which the systems 
are composed of only a few elementary systems. The fact that this form is 
questionable for the case of relativistic systems has already been mentioned 
in VIII. It is important that the reader be aware of the limitations of this 
structure for the description of the composition of elementary systems 
because some physicists assume that this structure is universal, many obscure 
conclusions are drawn from this generality. Both macrosystems and 
relativistic systems indicate that there are limitations of the structure for 
composite systems described in VIII, a result which is already evident in the 
physical basis of this structure given in VIII. 





2 Macroscopic Systems 361 


There is another very striking argument that the structure described in 
VIII cannot be universal. The method of product spaces can be extended to 
arbitrary many systems. But there are not arbitrary many systems in the 
world. There must be a limit of the validity of this structure. 

Nevertheless we may formulate quantum mechanics for “many” particles 
using VIII, that is, for such systems which can also be described in terms of 
state spaces Z. If we restrict quantum theory to such systems types of many 
particles we will denote the theory by 3k^~ qew , where the index exp shall 
indicate that we are considering the formal extrapolation of quantum 
mechanics to macrosystems, even though we know that 3k3T qf .^ has unrealistic 
consequences. Although l?&~ qtxp is a restriction of the total quantum 
mechanics we will, for simplicity, use the same letters M, J, 3i 0 , 3k as in II. 

We now have two theories for the same domain of facts, one given by 
3k2T m with M m , J m , 3i 0m , 3k m , Z, Y and S’, and the other .^,^ exp with M, SL, 
3k 0 , 3k. We conjecture that not all of the elements of J, 3k 3k are realizable, 
but are in part only invented preparation and registration procedures, 
invented in order to maintain the structures presented in III-VIII for 
Therefore it is reasonable that the relation between 3k3T q exp and 3k!T m 
can be described in the following way: There exists a bijective map M m M 
for which iJ m c J, i3t 0m c 3t 0 , i3k m c 3k, and the representation of the 
Galileo group in 3P3T m and its representation in 3k.3T qf:w are compatible. We 
shall only examine the last condition for the physically important time 
translation. 

The meaning of the time translation T z as a displacement of the registration 
relative to a preparation in 3k. has been described in VII, §1. Here we only 
need the fact that b -* T z b is a map of 3k into 3k. 

In SkST m a time translation operator T t which maps 3k m into 3k m can be 
defined using the trajectories z(t ) e Y and has the same physical meaning as 
T z in SP3r qe%v : For this purpose we define for x > 0 

T t z(t) = z(t + t). 

T t b is then the registration which responds to z(t) in the same manner as 
does b to z'(t) = z(t + r). In particular, for peg: 

T t p = {z(r) | z’(t) = z(t + x )ep}. 

For macrosystems T z is meaningfully defined only for x > 0, that is, for the 
semigroup of positive time translations, since z{t) is defined only for t > 0. 
(Here we suggest that the reader examine the combination problem (see 
III, §1) for the case of macrosystems for negative time displacements!) 

For the injection i (as mappings of 3l m in J and 3k m in 3k) we require that 

4(a n b 0 , a n b) = A((ia) n (ib 0 ), ( ia ) n (ib)), 

A((j'a) n (iT z b 0 ), (ia) n ( iT x b )) = A((ia) n (T z ib 0 ), (ia) n (T z ib)), 

where X m is the probability function in 3k3T m . 
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The mapping T r in 2k p is determined by the operator H given in VIII 
(5.8). Then, for given i, the behavior of the trajectories z(t) is determined by 
(2.1) and the Hamiltonian operator. This is precisley the essence of what is 
called “statistical mechanics”. 

The above requirements (2.1) for the embedding i of the theory 2k2T m into 
2k3~ q exp represent a research program. Here we may only present a few aspects 
of this program, for a more complete description, see [7] and [35]. We shall 
only briefly mention a few important aspects. 

According to the meaning of 2P2T m there exists, as defined by the pre¬ 
theories, a measurement of the state z “at time” t. In such a measure¬ 

ment “at time” t is not defined, a fact we have shown at the beginning of VII, 
and again in XVII, and we shall again consider in §4. Therefore a meaningful 
connection between 2k3~ m and 2k2T qf , xv can be made only with the help of T t , 
and not with the help of the concept of measurements “at time” t. 

It follows that (2.1) cannot be exactly satisfied for all x > 0. Physically 
it is sufficient if (2.1) is satisfied (providing that t is not too large), to such an 
approximation that the differences between the left- and right-hand sides of 
(2.1) cannot be tested because it is impossible to perform sufficiently many 
experiments. 

The existence of such a map i is called compatibility condition of the 
objective mode of description of macrosystems with ,S^ cxp . The opinion 
proposed here is that the subsets i,2 m and i9k m describe the “physically 
possible” preparation and registration procedures. 

A general theory which determines the map i has not yet been found. In a 
few special cases (e.g. the Boltzmann distribution function) the map i can be 
constructed. In practice physicists handle this generally unsolved problem 
by “guessing” some of the observables which are in the range of i (some of 
the so-called macro-observables) with the aid of the correspondence 
principle, physical intuition and luck. 

For some it is unacceptable to abandon quantum mechanics as the most 
comprehensive theory. Proponents of this viewpoint will often seek, within 
the context of 2k2T q ^ p , to deduce the structures of &9~ m . The first step of such 
a deduction would be the derivation of the sets il m , i2k 0m , i2k m alone with the 
aid of the structure of Hilbert spaces described in VIII and with the aid of 
the Hamiltonian operator. Different such attempts have taken place, using 
the aid of limiting processes in the sense of increasing “particle number.” But 
the structures present in &2T qexp are too weak for such a deduction. We may, 
for example, derive electrostatics from electrodynamics, and obtain a theory 
which is valid for a restricted domain. Here, however, we require more—why 
are only such preparation and registration procedures from il m and i2k m 
realizable, and not those from all of 21 and 2k ? If we take this physical inter¬ 
pretation of i2l. m and i2k m seriously then #^ exp is not a closed theory. By 
replacing 2L, 2t 0 , 01 by the subsets i2t m , i2k 0m , i2k m then .3A^ exp no longer has 
the form of an axiomatic basis (see I). In particular, the equivalence class 
decomposition of iJ m , etc, is totally changed, that is, we obtain new ensembles 
and effect sets, from which we obtain a total change of the structures set 
down in III for 

y exp 
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Although the compatibility of 3P3T m with ^^ exp could not be proven in 
general, the present studies of this problem show that this compatibility 
appears to hold with almost certainty (see [7]). 


3 Compatibility of the Measurement Process with 
Preparation and Registration Procedures 

In XVII we have seen that the quantum mechanical description of some of 
the parts of the measurement process is compatible with the viewpoint 
presented in II. On the contrary, we cannot, using the methods described in 
XVII, explain how we may establish measurement results as objective entities. 
Quantum mechanics by itself is not sufficient. Assuming the compatibility 
of #.^ exp and then, in principle, it is possible to prove that a physical 
(but not a purely quantum mechanical) description is compatible with the 
viewpoint presented in II. 

Here we shall not provide a detailed analysis of the individual steps (see, 
for example, [36], [7] and [37]). It suffices, on the contrary, to make clear 
that the structure in II can also be deduced from the structures of SP,T m . We 
shall not describe these deductions mathematically here; these deductions 
can be found in [6], XVI or (somewhat more difficult, but with more detail) 
in [7], 

Here it is sufficient to outline the deductions as follows: The structure of 
experiments with microsystems outlined in I and described mathematically 
in II is developed from preparation and registration apparatuses, where the 
latter are macrosystems. Preparation and registration apparatuses combined 
to an actual experiment also represent a macrosystem which is composed out 
of two macrosystems. Thus, to experiment with microsystems we are 
immediately confronted only with macrosystems and their trajectories. 
The single characteristic of the experiment is that the apparatus is composed 
of two parts, and that the first part (the preparation apparatus), influences the 
behavior of the second part (the registration apparatus) but not vice versa. 
The directedness of this interaction from the preparation part to the registra¬ 
tion part may be purely defined with the help of macroscopic description of 
trajectories (see [6], XVI, §1.3 and [7]). 

Thus experiments with microsystems can be described entirely in the 
macroscopic domain without the “use of microsystems.” 

If we now consider the structure of this directed interaction we must 
restrict our considerations (as in the case of any physical theory) to a certain 
domain of application, that is, to the fundamental domain. Its boundaries 
can be gradually ascertained by introducing additional axioms (called 
normative axioms). These axioms can be introduced in such a manner that 
we may deduce the structure in II and III, where the set M is, at first, nothing 
other than the set of individual interaction processes (see [7] and [6], XVI). 
In this way the microsystems appear to be physically real entities which carry 
the interaction from the preparation part to the registration part. The 
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historical discovery of microsystems can therefore be duplicated in a mathe¬ 
matical form within the context of a theory of macrosystems! 

The theory can be applied to the combined preparation and 

registration apparatuses as a macrosystem. If the compatibility discussed 
in §2 between ^^ exp and is valid then it also applies between the 
macroscopic description of experiments with microsystems, that is, between 
the structure introduced in II and III and the theory .5^.^ exp . Nevertheless 
there exists an essential difference between the physical description of the 
subprocesses of preparation and registration in XVII and the description 
of the entire processes of preparation and registration in 8P!T m , where SPST m 
and SP3 ~ qeyip are compatible. This difference arises from the fact that for 
macrosystems we regard rather than ,^.^ exp as the more comprehensive 
theory, and therefore we do not regard all aspects of .^.^ exp as real. For 
example, the registration process is closed, providing that the trajectories on 
the registration apparatus have taken place despite the question whether 
these trajectories are ascertained by an “observer” or not? The formal 
possibilities in the theory #<^ exp to measure other observables than those 
which correspond to the macroscopic observables associated with the 
apparatus do not possess any reality. These formal properties belong to the 
unrealistic part of ^3T qexi ,. The microscopic observable that is measured by the 
registration apparatus is solely determined by this apparatus (and not by 
observers) (for a more precise mathematical formulation of this result see 

[ 7 ])- 

We may now illustrate this result using the example discussed in XVII— 
the example of the microscope. After the scattering of the photon and the 
passage of the photon through the lens, we may still determine whether the 
photographic plate lies either in the image or focal plane. If, for example, the 
photographic plate lies in the image plane, then the observable is fixed; in 
this case, as an approximation observable for the position of the electron. 

A bizarre example of the problem of “fixing” the registration result has 
been given by Schrodinger—Schrodinger’s cat [38]: 

We may construct a bizare case. A cat is imprisoned in a steel chamber, together 
with the following infernal machine (which is out of reach of the cat): in a geiger 
counter there is a small amount of radioactive substance, so small, that in the 
duration of a few hours as few as a single atom decays with equal probability for 
none. If the counter responds, a relay is activated, and a hammer breaks a vial of 
cyanide. If the system is observed in an hour, we will say that the cat is living if no 
atom has decayed. At the first atom decay the cat will be poisoned. The function 
of the entire system is then expressed in the form that the cat is in a mixed state of 
being dead and alive. 

In our description of quantum mechanics the result is objective, and does 
not depend upon whether someone observes the system or not. There are no 
“measurement possibilities” in the macroscopic domain other than the 
determination of objective results, whether the cat is dead or alive. The 
objective completion of the registration is already determined by the action 
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or inaction of the hammer, that is, the effect of the microsystem is made 
evident in the macrosystem. The objective completion of the registration 
exists as soon as the action of the microsystem has reached the macroscopic 
domain. Here the objection is sometimes raised that such a description is 
inconsistent because it would be necessary to introduce a “discontinuity,” 
at a suitable “size” of the system at which the quantum mechanical description 
is suddenly transformed into the macroscopic, objective one. We shall return 
to this objection in §5. 

At the end of a quantum mechanical measurement process we do not need 
to consider such notions as the consciousness of an observer nor “my 
consciousness” or that of “a friend” (see such discussions in [34]). The 
“completion” exists, in our description, in the macroscopic realm, and it 
does not matter who notices the result nor in what form the result is ascer¬ 
tained and “fixed” (for example, in a computer). 


4 “Point in Time” of Measurement in Quantum Mechanics? 

In VII and in XVII we have shown that, with respect to the registration of 
microsystems, it is not physically meaningful to attribute a “point in time” 
at which the measurement occurs. In VII we could not exclude the possibility 
of introducing additional structure which would permit us to speak of making 
a measurement at a particular instant of time. In XVII we saw that quantum 
mechanics makes such a possibility doubtful because measurement scattering 
processes, like any scattering processes, last a finite time. However, we could 
imagine that we speak of idealizations if we speak of the instant of a measure¬ 
ment, that is, of the idealization that the scattering processes used are “very 
short.” The duration of such measurement scattering processes is not so 
much determined by the interaction of the systems 1 and 2 rather than by the 
ensembles Ifj and W 2 : The duration is the time where the W t (see XVI, §§1 
and 2) perceptibly differ from W\ and W{. 

Even if this duration is very short, the measurement scattering morphism 
T(l, 2; W 2 ) (see XVII (2.2.7)) does not directly depend on the “point in time” 
of the scattering, since S does not depend on the time where W, differs from 
W\ and W{ (see XVI, §4.4). T(l, 2; W 2 ) depends only indirectly from this 
“time” by the ensemble W 2 (see also the gedanken experiment in XVII, §2.1). 

Therefore a definition of a point in time where the measurement occurs is, 
in general, impossible and very artificial for a few cases. We will now show 
that such a definition is also not necessary for the interpretation of quantum 
mechanics. 

But what is then the meaning of the time parameter for a time dependent 
decision observable A(t)? In many textbooks we can read that t is the point 
of time at which the observable is measured. Is it therefore necessary to 
restrict the measurement of time dependent observables A(t) to measurement 
processes with “very short” measurement scatterings? Many textbooks seem 
to answer this question in the affirmative. Nevertheless this restriction to very 
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short measurements is not necessary. Only in the case of approximation of 
external fields in VIII, §6 it was necessary to presuppose the existence of 
“short” measurement processes, but short only compared with the variations 
of the fields. 

The opinion that the time parameter in A(t ) denotes the time where the 
observable is measured results from an uncritical transfer from classical 
mechanics to quantum mechanics. 

For macrosystems we have presupposed (see §2) that it is clear from pre¬ 
theories what it means to measure the state that a system has at a given time t. 
We say that the pre-theories define the “direct” measurements of the states 
at various times t. We presuppose, for instance, for a system of mass points 
that pre-theories tell us what is meant by position and velocity of the mass 
points at a time t. Such a pre-theory may be, for example, geometric optics. 
These classical concepts cannot be transferred to quantum mechanics since 
the pre-theories for quantum mechanics do not tell us, what is meant by the 
position and momentum observables Q(t) and P(t). The pre-theories for 
quantum mechanics define only the preparation and registration procedures 
and especially the trajectories of the macroscopic preparation and registra¬ 
tion apparatuses, that is, the states which the apparatuses have at various 
times t. 

The direct measurements of the states of a classical system at a time t are 
already defined without any use of the dynamics of the classical system, for 
example, of the dynamics of the system of mass points. But also for classical 
systems we may use the dynamics to enlarge the possibilities for measurement 
by “indirect” means (for the concept of indirect measurements see [8], §10). 
For the system of mass points we may directly measure r®(fi) and r^Q). 
Then from the dynamics we obtain the values r ii) (t 2 ), r ii \t 2 ) for another time 
t 2 , that is, also for a time t 2 where direct measurement will no longer be 
possible. For example, we may indirectly measure the positions of the planets 
for many years B.C. 

In quantum mechanics all observables can only be indirectly measured on 
the basis of the directly measured trajectories of the apparatuses. Therefore 
it is impossible to define a time dependent observable A(t) in the same way 
as in classical mechanics, that is, by a direct measurement of A at a time t. 
To see this more clearly, we reflect on the following two time dependent 
observables A(t) and B(t) where B(t) = A(t x + t) for a fixed Q. Both A(t) 
and B(t) satisfy the relations 

A(t) = e iH, A(0)e~ iHt , 
f B(t ) = e iH, B(0)e~ iHt . 


For which of these two observables ,4(0 or B(t) is t the so-called time of 
measurement? It is just not possible to distinguish A(t) and B(t) by the 
(nonexistent) criterion that, for example, the time t in A(t) is the “point in 
time” for the measurement of A(t) while the point in time of the measurement 
of B(t) is Q + t. 
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Is there no possibility at all of distinguishing between ,4(f) and B(t) ? There 
is another possibility: The behavior of the observables under Galileo trans¬ 
formations. This behavior is a good intrinsic possibility in quantum mechanics 
which we have utilized in the definition of Q(t ) in VII, §4. In general A(0) and 
B(0) do not have the same behavior under Galileo transformations. 

The fact that there is no possibility in SP.T q ^ v to define the values of the 
observable “at a time t” does not contradict the compatibility between 3P.T m 
and 2PAr qaLV . The macro-observables which correspond to the trajectories are 
not defined in SPAT q< , xv alone but are defined in 3P-T qt . xx> by means of the 
imbedding map i. The macro-observables for the states of the macrosystems 
at time t are defined in this way. 


5 Relationships Between Different Theories and 
Quantum Mechanics 

Since we do not consider quantum mechanics to be a universal theory, it is 
important to survey the relationships between the dilferent theories and their 
domains of validity (fundamental domains—see I). We shall use the general 
analysis from [8], §8 (see also [6], III, §7). Familiarity with the analysis 
presented there is not necessary for an understanding of this section. Our 
discussion will be based upon the following diagram, which we shall provide 
a detailed explanation later: 



Here we do not claim completeness. We only intend to give an example of the 
network associated with physical theories. The following will also serve to 
help to correctly describe the place of quantum mechanics 3PAT q in physics. 
Here &,T q will denote the theory which we have presented in II-XVII. In the 
diagram there are also new theories which we have not described which, 
except for knowledge of some of their general structures, are at present only 
partially understood. In the diagram these are denoted by SPAT., and 3P!T m . 

In §2 we have briefly described the meaning of the theories AP.% and SP.T m \ 
APAT., is the desired most comprehensive theory of macrosystems, which 
includes the mutual interactions of macrosystems. The expression “macro” 
characterizes the domain of application of the theory. Here we must be 
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careful not to include cosmology, because such an inclusion would introduce 
totally new problems with respect to preparation and registration (see §6). 
We shall restrict 0.T., to a domain, which we shall call the laboratory domain. 
Then in the above diagram will be the most comprehensive theory. It 
may, however (at least among the theories presented here), not apply to high 
energy interactions in order that it not include elementary particle theory. 

We may already imagine some of the structure of 0T m . It should describe 
macrosystems in terms of trajectories in a state space, as described in §2. 
3PT m would then be a restriction of the more comprehensive theory 0.0. 
The arrow in the diagram from 0^ to 00~ m represents this restriction. 

Many specializations, that is, restrictions of the partially understood theory 
02T m are familiar to physicists. We shall mention only a few of these restrictions 
00~k, SP&'hg, 00 ~b- Here 00~ h , 00~ B and 00~ ha are the theories of hydro¬ 
dynamics as described by the Navier-Stokes equations, the Boltzmann trans¬ 
port equation, and the theory of hydrodynamics as applied to the special 
case of gases. The arrows again illustrate the restrictions, for example, 
indicating that 00~ hg may be obtained from 0.0~ B by restriction. 

Of particular interest here is the part of the above diagram which lies to 
the right of the vertical line from 00-, to 0ST m . 

00~ mv) may be the restriction of the theory 00~ m which is appropriate for 
the description of directed interaction processes between two macrosystems 
(see §3). The theory 09~ q can be obtained by a restriction of the theory 00~ mw . 
The restriction consists of those directed interactions for which the axioms 
from III-XVII are usable. In brief, we say that 00~ q describes the interaction 
by means of “microsystems.” 00 qn may be the theory which is obtained 
from 09~ q if we begin by using the sets K of ensembles and the sets Lof effects 
as fundamental sets and “forget” that K and L are obtained from preparation 
and registration procedures. 00~ qn is therefore the “normal” version of 
quantum mechanics. It is customary to formulate this “normal” version by 
beginning with the sets d e K and d e L of extreme points of K and L where the 
elements of d e K are represented by the normed vectors of a Hilbert space 
0C and are called states. The elements of d e Lare the projection operators and 
are frequently called yes-no observables or propositions. 

Even though we are only certain that 00~ q is a theory of microsystems, 
using the structure presented in VIII we often extrapolate 00~ q to “many- 
particle systems,” that is, to macrosystems. This tendency of physicists to 
extrapolate the fundamental domain of a theory as far as possible, even when 
such an extension may be questionable, is completely legitimate in the 
evolution of physics. Only by such investigation is it possible to discover the 
limitations of a theory. If we extend the fundamental domain of 03~ q in this 
manner to systems composed of arbitrarily many microsystems, then by 
restriction of this theory we may obtain the theory 00~ qeiLV which is only 
applicable to many-particle systems. The dashed line in the above diagram 
refers to such a restriction, in order to indicate that the validity of all the 
axioms from III and XVIII for many particle systems (that is, macrosystems) 
is highly questionable. 
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The wavy lines in the above diagram indicate embeddings. In §2 we have 
described the embedding of 3P!T m into p with the aid of the embedding 
map i. The arrow from SPZT; to means that the embedding of in 

3P2T qaLp is perhaps only a special case of the more general embedding of 5P3~-, 
into @P£T q exp • For the discussion here it is important to note that we consider 
to be a more comprehensive theory than ^^~ qexp and that we believe 
that the only part of SPT qexp which reflects reality is the part which is the 
“image” of 0>2T m under the embedding. 

Why then do we actually consider the embedding of and into 
#^ exp ? That is why the theories and 3P3T m are not completely known. 
For the case of 3P2T m only certain partial structures are known, which can, 
with the aid of the map i, be completed by adding additional structures. The 
known Hamiltonian operator in 3PT qexp (although we have been very critical 
of its introduction in VIII) makes it possible to make precise statements 
concerning the dynamics of trajectories in &9~ m . Thus, we can, for example, 
obtain the structure of SP3T B from the partially known structure of 3P2T m and the 
embedding i, if we restrict our considerations in 0>ST m and in ^.i^ exp to 
“attenuated” gases. In [6], XV, §10.2 we have outlined how it is possible (in 
the above sense) to derive 3P2T B (and the interaction cross section used in 
in the approximation that 8P3~ qexp is approximated by a classical 
picture. A more precise description of such a derivation of can be found 
in [39]. It is in this sense the wavy arrow from to ^^ exp is to be under¬ 
stood to be a specialization of the embedding of 2PT m in 3P ^~ qexp . 

Similarly the wavy arrow from to #.f qeip is a specialized case of the 
embedding of into #.Vj, exp . This embedding of into 2P-T qexp 

guarantees the compatibility of quantum mechanics with the introduction 
of the preparation and registration structures presented in II, as we have 
shown in §3. 

A realistic development of including its embedding in 2P.T q<:xp would 
supply a solution to the following problem which was frequently mentioned 
in §3—how the theory SP.T q must change as we increase the number of particles 
in a composite system in order to obtain the theory in the case of “many” 
particles. 

Here we do not use the term “change” in the mathematical sense of con¬ 
tinuity. The axioms of II and III cannot be changed in a continuous manner. 
This is, however, not a new phenomena in physics. The comparison between a 
more comprehensive to a less comprehensive theory cannot always be 
carried out in terms of continuous mathematical processes. Thus, for example, 
the Newtonian space time theory and the Einstein space time theory of 
special relativity are both “correct,” that is, are both useful theories; Einstein’s 
theory is more comprehensive than Newton’s, and both appear to mathe¬ 
matically contradict the other (see, for example, [6], IX, §8). 

If in we consider interactions which are transmitted by physical 
systems (as carriers of the interaction—see II, §4.4) where the number of 
elementary systems comprising these carriers become larger and larger, then 
we must assume that the assumptions made in II, III, VIII become “less and 
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less realistic.” In 8P3~ mv) this phenomenon must be present because 3P-T mvj must 
not describe any unrealistic interactions. Thus, for example, the embedding i 
of into #J^ exp specialized to the embedding of in 3P3T qexp must 
show that for larger molecules (as carriers of the interaction between the 
preparation and registration parts) the axioms set down in II, III, VIII can 
no longer be valid, that is, that there must be a more comprehensive theory 
of interaction carriers than 3P.T q . Here, for systems consisting of few 
“particles” ??-T q and SP-T K should be in close agreement. For interaction 
carriers consisting of large numbers of elementary systems should be 
consistent with 3PST m . 

When we assert that there must be a more comprehensive theory SPPT, 
(which we may obtain from providing that is already known) we 
should also expect to find noticeable differences in the case of systems com¬ 
posed of some elementary systems. Have such differences already been 
observed? 

They have not, because the interest has been in making comparisons 
between actual experiments and the theory SP9~ q , and the question as to what 
is possible in experiments has not, in general, been made. For the most part 
the consequences of the structure of the Hamiltonian operator have been 
tested rather than the experimental consequences of the axioms in III. 

We now seek to determine the fundamental domain of 3PST r For elemen¬ 
tary systems, including coupling with external fields (in the sense of VIII, 
§6) 2P?T q appears to describe the possibilities of preparation and registration 
very well. The description of the possibilities to prepare composite systems in 
“bound” states is excellent providing that the eigenvalues of H are not “too 
close”—where by “too close” we mean that the transition frequency is of the 
order of days or years. Thus it is questionable whether, for large molecules 
(which have both right- and left-handed forms) the eigenstates of H can be 
prepared or whether H can be measured as an observable since the right- and 
left-handed forms are not eigenstates of H. The eigenstates of H are, on the 
contrary, superpositions of right- and left-handed forms with transition 
frequencies which, for large molecules, are very small, with the consequence 
that there are no transitions between right- and left-handed forms. 

It is questionable whether all ensembles for unbounded states, that is, for 
scattering states can be prepared. We are hard pressed if we inquire about the 
experimental possibility to prepare ensembles W' which are obtained from 
X (4.15) from W where W is an ensemble prepared in the usual way from 
known scattering experiments (see the remarks in X, §4). It seems impossible 
to prepare W' = CWC ~ 1 if IF is an unstable state (XVII, §6.5). 

We deceive if we believe that although it may be extremely difficult to 
experimentally realize many preparations and registrations that they may yet 
be possible “in principle.” Can they in fact be realizable “in principle” given 
sufficient financial resources? It is precisely here that the more compre¬ 
hensive theory SP,Tw makes actual physical assertions about the possibilities 
of realizing such preparations and registrations. 

It is precisely here that the embedding of SP-T m in .^^ cxp , that is, the 
specification of the embedding map i together with the Hamiltonian operator 
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H from are sufficient to enable us to determine SP^. If the domain 

of realistic preparation and registration procedures are known for macro¬ 
systems, then they can be found for other not-so-large systems. A jump from 
SP2T q to or 2P2T-, will therefore not exist. On the contrary there must be a 
theory 3P3~ V which is more comprehensive than 2P3~ q which describes the 
preparation and registration possibilities more realistically than does 8P2T q , 
for systems composed of “fewer particles” SP3~ W and 2P.T q will ((approximately) 
make the same assertions about these possibilities. 

From this conception of SP^ let us return to the considerations of measure¬ 
ment scattering in XVII, §2. Then it will be clear that the following description 
is unsatisfactory: In a sequence of measurement scattering processes in 
which every step is described by 3P3~ q , the final step of a scattering on a 
macrosystem is a sudden jump which fixes the measurement results. On the 
contrary, the registration possibilities become more restricted the larger the 
systems which are used to scatter the microsystems. Thus, in the case of 
measurement scattering experiments with large molecules measurement 
results can be fixed in the molecules themselves, even though additional 
physical processes may be required to make the fixed result accessible to 
us. The “development” of a silver bromide grain sensitized by a photon 
resulting in a silver grain and its observation by a microscope is an example 
of such a physical process which makes the objective property “sensitized” 
accessible to us. 

With the consideration of the fundamental domain of SP-T q and the dis¬ 
cussion of the possibilities for a more comprehensive theory SPST^,, we have 
achieved what a physicist would wish for the formulation of a theory and the 
determination of its bounds. In II to XVII we have developed SP3T q and 
illustrated it by means of examples. Here in XVIII we have shown the limits 
of application of 2PST r The fact that sharp boundaries cannot be drawn is 
typical for any SPIT. The similarities of a mathematical picture Jt2T in a 
SP.T with reality are not exact in the fundamental domain, and become less 
and less valid as they leave the domain. 

We have now concluded our description of 3PS~ q . Since some physicists 
have used quantum mechanics to justify certain ideas, we will now pose the 
question as whether these ideas can be justified by means of the theory 
gP&~ q developed here. 

6 Quantum Mechanics and Cosmology 

In cosmology we seek to obtain general statements about the universe as a 
whole. Here we find typical problems concerning the physical reality of 
statements about the universe (see, for example, [8], X, §10). In this section 
we shall only be concerned with the relationship between quantum mechanics 
and cosmology. 

In §2 we have given physical arguments for the fact that SPST ^ p cannot be 
the most comprehensive theory for macrosystems. An application of quantum 
mechanics to the universe as a whole is clearly not possible in the context of 


Pidie. 



372 XVIII Quantum Mechanics, Macrophysics and Physical World Views 


the formulation of quantum mechanics presented in II-XVII. In this formula¬ 
tion the notion of statistics presented in II plays a central role. Statistics is a 
theory concerned with frequencies of occurrence associated with selection 
procedures. Such frequencies of events can only occur in the universe. The 
universe is, as a whole, unitary. Therefore it is not meaningful to make 
statements of frequency about the universe itself because, in principle, there 
are not different universes, but only different parts of the same universe. 
Certainly we could imagine universes different from the one in which we live. 
Such a universe could not have any physical interaction with our universe 
otherwise both would be parts of a more comprehensive universe. Those 
who disagree with the analysis presented in §2 which shows that J-T q( . xp can 
only approximately describe macrosystems must seek elsewhere for the limits 
of applicability of ^T qexp because the universe, as a whole, cannot belong to 
the domain of application of .f.^, exp . 

Certainly there are attempts which seek to apply quantum mechanics to 
the entire universe [34]. What can we make of these attempts? If we examine 
these attempts we find that the meaning of quantum mechanics is different 
than the theory denoted by SP.T q and developed in II-XVII. Expressions like 
“state,” “probability,” “possibility,” etc. will have a different meaning than 
that used in the theory presented here. Expressions such as “state,” 
“probability,” “possibility” are associated with certain philosophical ideas 
which cannot be subjected to physical verification. We do not wish to 
prohibit the practice of adding philosophical ideas to a physical theory, but 
think that it is reasonable to do so after a theory has a physical interpretation. 
We know that many physicists think that every interpretation presupposes 
philosophical ideas. One of the reasons for writing this book was to show that 
it is possible to interpret quantum mechanics without such philosophical 
presuppositions, that is, to interpret it only on the basis of pre-theories. 

If, for example, a vector from a Hilbert space JC is said to represent the 
“state of the universe,” it would not make sense physically because no one 
knows what the “state of the universe” should mean as a physical concept. 
If by state we man what is defined in III, D 1.1, then the universe “has” no 
state ij/ because there cannot be an ensemble of universes. The preparation 
procedures in T.T q( . Kp will, in the application of ^T qexp to the universe, lead 
to absurdity because there is no single preparation procedure for universes, 
but there is only a single universe. 

On the other hand, no one is forbidden to propose a mathematical theory 
.M.T and to add “philosophical” ideas to such a theory. Since an TIT cannot 
in itself have a physical interpretation, it is necessary therefore to supply JIT 
by the introduction of mapping principles (JisrfT) (see I) in order to obtain a 
2PT. If we use the mathematical theory of Hilbert spaces for JIT in an attempt 
to apply it to cosmology, then it is necessary to provide a different interpreta¬ 
tion than that provided here for quantum mechanics in II and III. Physically 
what is important is not the species of structure upon which JtT is formally 
constructed, but what physical interpretation is attributed to the elements 
of JIT (see I and [8], §5). Only if JIT is an axiomatic basis can we easily 
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determine what structure aspects actually represent reality. The formal 
species of structure of Hilbert space cannot be obtained from an axiomatic 
basis (as obtained in II and III) if we seek to describe cosmology as a 2P2T. 
If we examine attempts to formulate a “quantum theory of the universe” then 
we shall readily see that philosophical grounds lead us to introduce such 
notions as “state of the universe” as a concept already understood before the 
theory, that is, a priori. Then it is possible to arbitrarily argue without 
coming to a conclusion whether it is meaningful to formulate a “quantum 
mechanics of the universe.” The exponents of such a theory will take the 
a priori ideas (which cannot be defined on the basis of pre-theories) as the 
sole approach to understanding physics. Others, such as the author of this 
book cannot attribute any physical meaning to such ideas and consider them 
to be meaningless word games. It is possible only to agree on the portion of 
the interpretation of a physical theory, the portion which was denoted by 
JistfgP in I (see also [8] and [6], III), namely, the portion which connects 
experiments with the mathematical picture JOT. 

For the case of a “physical” cosmology the problem of mapping principles 
JtstfSP is in no way simpler than it was in the case of quantum mechanics. 
Cosmological statements about time-space structure cannot be described in 
terms of pre-theories based upon experiment. On the contrary, pre-theories 
only describe phenomena which can be described “in the laboratory,” 
which, in turn, is influenced by the universe. For microsystems such experi¬ 
ments in a laboratory are carried out in a “shielded” environment; for a 
theory of the universe experiments are required which are essentially in¬ 
fluenced by the universe. In the case of quantum mechanics we seek to draw 
conclusions about the structure of the much smaller microsystems from 
processes involving “macrosystems” in the laboratory domain. In cosmology 
we seek to draw conclusions about the structure of the “much larger” 
universe from processes in the laboratory domain. The problem of mapping 
principles for cosmology is an interesting one, but one which, for space 
limitations, cannot be discussed further in this volume. 


7 Quantum Mechanics and Physical World Views 

It is often said that special relativity, general relativity and quantum 
mechanics have lead to “revolutions” in the world view of physics. 
Particularly, in connection with quantum mechanics there have been the 
most extensive different philosophies introduced. We refer readers who are 
interested in the various philosophical discussions about quantum mechanics 
to [40] in which there are thorough critical examinations of the various 
philosophical points of view and extensive bibliographies. 

The fact that quantum mechanics has been used as the basis for the most 
varied philosophical ideas clearly shows that not the physical theory noted 
quantum mechanics has philosophical implications but the reverse—that 
fundamental philosophical considerations have lead to different ideas about 
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quantum mechanics. Of course, the formulation of quantum mechanics 
presented here rests upon a fundamental physical outlook which is described 
in detail in I and in [8] and will now briefly be discussed. 

If we consider quantum mechanics to be a theory in the sense described 
in this book, then we cannot speak of a revolution in the physical world view. 
Instead, we will find that quantum mechanics has led to a critical reexamina¬ 
tion of what physics is about. In this way quantum mechanics has made it 
necessary to abandon preconceived ideas which were believed to be essential 
for physics. Such ideas were: physics as the analysis of causal connection, 
physics as an explanation of the past, physics as a way to predict the deter¬ 
ministic course of the world. What remains is physics as a method to discover 
the structures of reality, and to use these structures, a method solely based on 
objective facts and on the well-defined language of craftsmen. This view is 
described in detail in [8] and [6], III, and was briefly outlined in I. 

If we consider a physical theory as a method to discover the structures of 
reality, and representing these structures by the mathematical structures in 
JUT (see I or, for more detail, see [8] or [6], III), then there will be no 
“revolutions” in physics. For example, Newtonian physics and gravitational 
theory is as useful now as before despite the introduction of general relativity 
(see [6], IX and X). The astronauts travel between the earth and the moon 
according to Newton’s laws. 

The fact that in this analysis of the structure of reality it has been necessary 
to introduce new structures for the case of the universe as well as for the 
microworld which surpasses that required for the description of processes in 
the laboratory could only be puzzling to those who believe (on some philo¬ 
sophical ground) in the homogeneity of the structure of nature relative to 
scaling in size. 

One particular effect of quantum mechanics is particularly important: 
the need for sober judgment in physics and to reduce physical theory to what 
is real. Thus quantum mechanics forms a catalyst for a new and fruitful 
reflection about problems in the philosophy of science. Thus the discussions 
about quantum mechanics presented here are leading towards an examina¬ 
tion of the “fundamantal structure of a physical theory” as presented in [8], 

Thus the reduction of prejudices about physics resulting from a reexamina¬ 
tion of quantum mechanics make it possible to discuss other aspects of reality 
in a more meaningful way than before, for example, the problem of 
“consciousness” and the problem of free will. In [6], XVII these problems are 
briefly discussed. 





APPENDIX V 


Groups and Their Representations 


In the space available here it will not be possible to develop the theory of 
group representations in the generality used in VI-VIII. Here we shall only 
attempt to provide the most important aspects of representation theory 
which are necessary for practical applications in the theory of atomic and 
molecular structure. It is hoped that structures used in VI-VIII can be 
understood without the proofs. The proofs of the more general theorems can 
be found in the specialized literature [41]. 


1 Groups 

A set 0 is said to be a group if there is a structure defined on 0 in terms of a 
map 0 x 0 -> 0 (which for a, b, ce 0 we will denote by (a, b) -> ab = c ) 
which satisfies the following axioms: 

(1) ( ab)c = a(bc ); 

(2) there exists aneef such that ea = a for every a<=0\ 

(3) to each a e 0 there exists an element be0 such that ba = e. 

From (1) to (3) for x = ah it follows that fix = bab = eb — b. Since, according 
to (3) there exists a c such that cb = e it follows that cbx = c(bx) = cb — e 
and cbx = (cb)x = ex = x and we therefore obtain x — e, that is, ab — e. 
Thus it follows that ae — a(ba ) = (ab)a = ea = a. 

Suppose that there exists a b' for which b'a = e. Then it follows that 
b' = b'e — b'ab = eb = b, that is, the element b in (3) is uniquely determined 
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by a. We therefore write b = a" 1 . We obtain a ~ 1 a = aa~ x — e and 
(a" 1 )' 1 = a. 

Similarly it follows that the element e in (2) is uniquely defined since from 
e'a = a for all a e ^ it follows that e' = e'e = e. 

If, in addition to (1), (2), (3), the following condition 

(4) ab = ba 

is satisfied for all a, be<8 then f# is called an Abelian group. 

A subset cz ^ is said to be a subgroup if the group operations in 
satisfy the relation and if the relations (1), (2), (3) are satisfied. 

In order that 3fl? be a subgroup it is necessary and sufficient that a, be34? 
implies ab" 1 e34?. 

Proof. For a e G it follows that aa ~ 1 = ee ( S\ from e e y and a e ^ it follows that 

ea ~ 1 = a ~ 1 e^; from a, be^ it follows that ab~' e/S and therefore a(b ~ l )~ 1 = abetf. 

All powers a m (where a°= e, a~ n = (a~ 1 )") of the element a form a sub¬ 
group. A group for which all the elements are powers of a single element is 
said to be cyclic, and is therefore Abelian. Either a n # a m for m ^ n or there 
exists a k > 0 for which a k = e because from a" = a m it follows that a n ~ m — 
a m-i _ e if v is the smallest positive integer for which a v = e, then the 
elements a 0 = e, a 1 ,..., a v ~ 1 are all different and form the entire cyclic group 
because, for 0 < r < v it follows that a m = a qv+r = a? If 34? is a subgroup of 
this cyclic group and s > 0 is the smallest exponent with a s in 34? then for 
a m e34? it follows that a m = a qs+r = (a s ) q a r and therefore a r e34?; since 
0 < r < s it follows that r = 0. Therefore in 34? we find only elements of the 
form (a s ) 9 . Since a" = e in 34?, we must have n = p-s, that is s must be a divisor 
of n. For each divisor s of n the ( a s ) q form a subgroup. 


2 Cosets and Invariant Subgroups 

Let 34? be a subset of f# (not necessarily a subgroup). Let a34? denote the subset 
of which is obtained by multiplying the elements of M on the left by a. If 
3/C is a subgroup, then we say that aM 1 (where ae'S) is a left coset and 
is a right coset. If a e 34? then a 34? — 3/C and conversely since ae = ae 34?. In 
addition a34? = b34? if and only if a~ 1 be34? since if a~ l b e 34? then bJ4? = 
aa~ 1 bJ4? = a(a~ i b34?) = a^C\\fa3^ — b34?thena~~ x b34? = 34? from which it 
follows that a~ 1 be 34?. Different cosets have no common elements; if a34? and 
had a common element ac x = bc 2 then a' ^ = CjCj 1 e -^? and the cosets 
would be identical. Each a belongs to a left coset namely a34?. Each coset has 
the same cardinality since ac <-> be (for c e Jf) is a bijective map of aJf onto 
Jzfi. The entire group is the union of its cosets. If j is the number of different 
cosets (the “index” of 34?), N the cardinality of G and n the cardinality of 34?, 
then N = jn. 
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If the left cosets are also right cosets then atf = jF a because Wa is the 
right coset which contains a. Then we have Jf = a#£a~ \ and JF is called 
an invariant subgroup. 


3 Isomorphisms and Homomorphisms 

If f# and ( S' are two groups and if there exists a surjective map <&-+<$' such 
that ab —► a'b', then is said to be homomorphic to <&', and we write f# ~ . 

From ea — ait follows that e'a' = a' from which we conclude that e' is the 
identity in . From a~ l a = e it follows that (a l )'a' = e‘ from which we 
conclude that (a -1 )' = (a'Y 1 . If ^ is a (not necessarily surjective) map 
satisfying ab -> a'fe' then /(^) is a subgroup of and f# -4 /(^) is a homo¬ 
morphism. If the map ^ -> is bijective, then ^ is isomorphic to W which we 
denote by ^ s 

If ^ ~ and if ,/F is the subset of ^ which is mapped upon the unit 
element e' then JT is an invariant subgroup in ( S. 

Proof. If a, be Jf then ab~ l = e'(e') = e' from which we conclude that ab ~ 1 e Jf 
and Jf is a subgroup. If d is an arbitrary element of f then da -> d'e' = d', that is, 
d./V -> d'. Conversely, if x-> d' then d~ i x -> (d')~ l d' = e' from which we conclude 
that (/'‘reT and that x e <LF The residue classes dJf is precisely the set of elements 
which are mapped upon d'. In the same way we may show that jfd is the set of 
elements which are mapped upon d’. Therefore d.A r = ,/F d. 

The set of cosets of an invariant subgroup forms a group with the operation 
d v A r d 2 -J r because d v Jfd 2 Jt r = d i d 1 A r ,yV' = d i d 2 -A r . JfdJf = d,/\ r (there¬ 
fore Jf is the unit element), d ~ 1 JfdJf = ,/F This group is called the factor 
group &/Jf. From $ ~ it therefore follows that <&’ s ( Sj-Af where ,/F is 
the subgroup of ( S (kernel) which is mapped upon e'. On the other hand, 
ff ~ ^/,/F on the basis of the map a -> aJf. For a homomorphic map 
^ -> the diagram 


-> & 



9/jr 


is commutative. 


4 Isomorphism Theorem 

Let ~ (?. Then "f s W/Jf where Jf is the set of elements of ^ which are 
mapped upon e, the unit element of ( S. Let JF be a subgroup of which is 
mapped by the homomorphism (of f) onto #) onto .tF -if is then clearly a 
subgroup of < S. Let JF denote the set of all elements of ^ which are mapped 
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onto X Then X c X. If a e X then X is the ensemble of all aX where 
a e X, that is, X = XX From this it follows that X is a subgroup of ( S since 
if a«! and bn 2 (a, b e X and n 1 ,n 2 e X) are two arbitrary elements of X then 
an l (bn 2 )~ 1 = an 1 n 2 1 b ~ 1 = an 3 b _1 (wheren 3 = n 3 n 2 *e/). SinceXb -1 = 
b~ *X it follows that an 3 b" 1 = at” ‘n 4 e X because ab“ 1 e X since X is a 
subgroup. X is therefore homomorphically mapped upon X The set of 
elements which are mapped onto e is X because X is a subset of X There¬ 
fore X £ X/X = XX/X From the homomorphism of X onto X it 
follows that X £ X/X n X since XnXis the set of elements of X which 
are mapped onto e. 

Therefore we obtain 


XX/X s X/X n X (4.1) 

Let ^ be mapped homomorphically onto § and 'S be homomorphically 
mapped onto ^/X where X is an invariant subgroup of In this way 'f# is 
homomorphically mapped upon ^/X and we therefore find that aF/X = 
0/X where X is the set of elements which are mapped upon the unit element 
of (f/X— -precisely the same elements which are mapped by the first map of 
^ onto to elements of X. If X is the invariant subgroup for which the map 
of onto % to the unit element of ( S, then we find that (F £ X'X For this 
homomorphism of onto (F we find that X (which contains X) is mapped 
onto X, and we therefore obtain X = X/X Thus, from 5?/X £ <F/X it 
follows that 

af/X £ (SF/X)/(X/X). (4.2) 


5 Direct Products 

A group is said to be a direct product of two subsets Jf x and X 2 

at = Xj x jt 2 

if and only if: 

(1) ^ = X!X 2 . 

(2) Xj and Jf 2 are invariant subgroups in < S. 

(3) ^n/ 2 = {e}. 

This is equivalent to the following: 

(a) Each of the elements g of can be written in the form g = n y n 2 where 
n x e,yf), n 2 e.A r 2 . 

(b) The elements of commute with all the elements of Jf 2 . 

(c) n t , n 2 are uniquely determined by the relation g = n 1 n 2 . 

Proof. (1), (2), (3) imply (a), (b), (c). 

(a) is an immediate consequence of (1). In addition n^nf 1 e/ 2 because X 
is an invariant subgroup. Therefore n 1 n 2 nf l n 2 1 sjV~ 2 . n 2 tii l n 2 l is, however, an 
element of X, and the same is true for n 1 n 2 n^ 1 n 2 '■ According to (3) it follows that 
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n^n^n^ 1 = e, from which it follows that (b) holds. If n,n 2 = n\n' 2 then from 
(n\)~ 1 n 1 = n' 2 n 2 1 is an element of both and jF 2 and is therefore equal to e, from 
which we have proven (c). 

(a), (b), (c) imply (1), (2), (3): 

(1) follows directly from (a). Since n 2 commutes with the elements of it follows 
that gjV[g~ l = n l n 2 J r 1 n 2 1 ni l = n 1 .A / ' 1 n l ~ 1 = from which we have proven (2). 
If m is an element of both .A'l and , then it can be represented in two ways in the 
form n l n 2 : m = me = em. From uniqueness (c) it follows that m — e. 

The generalization of this result to several factors 
3 = x jV 2 x ••• x JF r 

is immediate: g = ■ ■■n r where the n v all commute and are unique. 

For several independently given groups 3 U 3 2 , • • • > we may define a 
new group with elements g = g t g 2 ■■■ g r where gg' = (grf'Jig 2 g , 2 ) ■ • ■ (g r g' r )- 
It is easy to show that 3 = 3 2 x 3 2 x • • • x 3 r . 

\i3 = jV x x jV 2 x • • • x ^and d p = x Jf 2 ■ • • x jF p -i x jV p+1 x 
• • • x Jf r , then f x and J3 p n d p = {e}. Therefore, according to 
the isomorphism theorem, 3/d^, = d p and 3/d p = ./V f) . 


6 Representations of Groups 

Let ^ be a group and let a map Q of 3 into the set of linear operators for a 
vector space 9C be defined such that, for a e 3 and a -* Q (a) the following 
equation 

Q (a)Q(h) = Q(ab) 

is satisfied, then it follows that the vector space 9C can be written as a direct 
sum 93 = i + i 0 where i = Q(e):F and t 0 = (1 — Q(e));£. For uet and for 
v 6 i 0 it follows Q(e)u = u and Q(a)v = 0 for all a e G. If 93 is a Hilbert space 
and Q(a) is a unitary operator it directly follows that Q(e) = 1. If Q(e) = 1 
then it follows that Q(a _1 ) = Q(a) 1 . 

For a fixed representation, for aef we shall write au instead of Q(a)nby 
means of which ^ is a set of linear operators of 93. We then say that 3 is an 
operator domain of 93 (see AIV, §14). 

If 93 is a finite-dimensional Hilbert space and if -3 is a unitary representa¬ 
tion (that is, all the Q(a) are unitary) then since Q(a ~ *) = Q(a) + , the operator 
domain 3 is completely reducible (see AIV, §14). If 93 is finite dimensional 
(but not necessarily a Hilbert space) and if 3 is a finite group, then the 
operator domain 3 is completely reducible. This can be shown by introducing 
an inner product in such a way that the operators in 3 are unitary. If u u ...,u„ 
is a basis for 93, then we define a function F(u, v ) as follows: 

n n n 

F(u,v)= £ s vJ3 v for u=Y “» a * and v = X u *Pv 
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Let the elements of the group (and therefore the operators of 9C') be denoted 
by a, b,c,.... Ya or Yb shall denote that we are to sum over all finitely many 
group elements. We define the inner product as follows: 

(u, v} = Y F(au, av ). 

a 

If b is a different element, then, for c = ab we obtain 

(Jju, bv ) = Y F(abu, abv ) = Y F(cu, cv ) = (u, v") 

a c 

because, as we run through all a, c = ab (for fixed b !) runs through all the 
elements of the group exactly once because the equation c — ab has an 
inverse a = cfr -1 . Thus all group elements b are unitary with respect to the 
above defined inner product. 

In §10.6 we shall find that each unitary representation of a finite group or 
a compact group is completely reducible in an infinite-dimensional Hilbert 
space, and we may apply the theorems of AIV, §14 to (instead of M). 

7 The Irreducible Representations of a Finite Group 

In order to find all possible irreducible representations of a finite group ^ we 
shall consider the group algebra or group ring 0t 9 . Its elements are defined 
by the set of all (formal) sums Ya Pa a where the fi a are arbitrary complex 
numbers. The sum of two ring elements is defined by (£„ p a a ) + (£ a y a a) = 
Ya (Pa + fja; the product of two elements is defined by (Ya Pa «) (Yb 7^) = 
( Ya.b PaTbab). It is easy to verify that the usual rules of addition and multi¬ 
plication for ring elements are satisfied. The unit elements e of ^ is also the 
unit elements of the ring. 

With respect to addition St 9 is therefore an h-dimensional vector space 
(i h = order of the group ( S') with the operator domain where the product 
of a with a ring element t = Yb Pbb is defined as follows: 

at = Y Pbab. (7.1) 

b 

This representation of ^ in is called the “regular” representation. It is, 
according to §6, completely reducible. 

An invariant subspace is a subspace of 0l 9 which has the following property :■ 
if the subspace contains t, then it contains all at (a e r S), and therefore contains 
all (Ya Pa a )t = st where s = fi a a. 

Therefore the invariant subspaces are those subsets / of which contain 
the difference s — t for s, t e / and the product st for t e / and arbitrary s in 
. Such subsets of a ring are called left ideals; if a subset contains ts instead, 
it is called a right ideal. If both, then it is a two-sided (dual) ideal. St 9 is 
therefore a direct sum of irreducible left ideals, 

= Al + ^12 + • • • + £\a + ■ • ■ + ^qp (7.2) 

where the with the same first index are isomorphic. 
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We now have the following important theorem: 

Every irreducible representation of is equivalent to a representation of 
f# in one of the /’s. In other terms, every irreducible vector space (%)3k where 
( S is the operator domain is isomorphic to one of the /’s. 

We note that every representation of can be extended to a representation 
of 0t$ defined in 3k where: (X,J? 0 a)w = P a (au). The decomposition of a 
vector space 3k into irreducible components is not changed as a result of this 
extension of the operator domain. 

Let (3k,g)3k be an irreducible representation space of 0t 9 . Let v be an 
arbitrary element of 01. We shall now consider the following map of 3k% onto 
for a subset of 3k \ s -* sv (where se3k<$). This map is a homomorphism 
(here 3k$ is considered to be a vector space with the operator domain 3k r $) 
because, for te3k<$\ 

ts —<• (ts)v = t(sv). 

Thus 01 must be isomorphic to a direct sum of some of the / (see AIV, §14). 
Since 01 should be irreducible, it follows that 0t is isomorphic to one of the k Vfl . 

If, in the decomposition (7.2) we collect the classes of isomorphic by 
one subspace / v , the decomposition of 3k% can be written in the form 


3ky — + ^2 T ' ■' T kq (7.3) 

with uniquely determined (see AIV, §14). 

We shall now consider the homomorphic maps of 3k 9 onto itself. Let T 
denote one of these. Suppose that the unit element is mapped upon t: 

Te = t. (7.4) 

Since T is a homomorphism, it commutes with the elements s of 3k <# as 
operators; it follows that: 


Ts = Tse = sTe = st. (7.5) 

Each homomorphism T therefore corresponds to a ring element t for which 
Ts = st. Conversely, the map s -> st is, for arbitrary t, a homomorphism since. 

rs —<• ( rs)t = r(st). 

If we carry out the two homomorphic maps T u T 2 successively, it follows that 
1\T 2 s = st 2 t u that is, 7) T 2 corresponds to t 2 t 1 . The map 1\ + T 2 clearly 
corresponds to t x + t 2 . 

The homomorphic maps of a complete reducible vector space are, from 
AIV, §14, already known to us. The ring 3k is therefore “oppositely” iso¬ 
morphic to the ring of homomorphic maps of 3k v onto itself, “opposite” in 
the sense that 7) T 2 corresponds to the product t 2 t v 
Let E$} denote one of the isomorphic maps of onto (where 
BJ/j’s = 0 for all s e k with v # 1 or \i # a), then these maps correspond to 
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elements e$ of 3k Now, according to AIV, §14, T = Z*/jv r$£$ an ^ there¬ 
fore every t can be written in the form t = Za/sv From E$E§} = £$ 

it follows that = e$. Similarly, in general we obtain: 

e$e% } = K^-AV- (7.6) 

In the e$ we have found the decomposition of St 9 into irreducible left ideals: 
e ( xl, e ( 2 i, 63 V,... form the basis (the e { £ are linear independent because the 
E§1 are) of an irreducible left ideal , because 

te$ = Z = I (7.7) 

a/?v a 

Therefore t will be represented in this left ideal by this basis in terms of the 
matrix The dimension n x of i x is therefore equal to the number of the 
/ 1(Z , that is, the number of left ideals isomorphic to ( xx . The left ideals k lp = 

(e^J, e^J,...) are, however, isomorphic to / n . We obtain / 11 + t 12 -1-- 

which therefore has the basis e$. The i x are therefore rings of dimension n x . 
In particular, we obtain e = Zp.v e pp- 
The number of different irreducible representations of a group is therefore 
equal to the number of the t x . If the representation generated by the ring t x 
is of degree n v then the dimension of i x is equal to n x , and the dimension of 
the ring 3k<g is equal to Zv n l- Since the dimension of 3k<$ is also equal to the 
order h of the group, we obtain the theorem of Burnside: h = Zv n l- If we 
consider a special element a of the group and its decomposition 

Z 

pav 

then the g ( p v J form, as a function of a, that is, as a function on the group 
manifold, a complete linear independent system. It is only necessary to 
demonstrate the linear independence because the completeness follows from 
the fact that there are exactly as many functions as group elements. Suppose 
that Zp-tv tyWpXa) = 0 for all a. Then since t = Za= Zp<™ il 
follows that r ( p v > = Z« PadpXa) and we therefore obtain Zpav = 0 for 

all possible values of t p< *. This is only possible if all the X ( p l are equal to 0. 

The center 3£ of a ring is the set of all elements of the ring which commute 
with all the elements of the ring. is itself a ring. Let z = Z a Pa a belong to the 

center 2£ of the ring 3k%. Then for all elements of the ring t Za Pa a = Z« P« at - 
This condition is satisfied if it is satisfied for all the elements of the group, that 
is, h Zo P a a = Z« Pa a b and therefore Za p a bab~ 1 = Za Aa This is the case 
if and only if fi a = [l bab -1 for all b. An element c from ( S is said to be conjugate 
to d if there exists an element b such that c = bdb -1 , from which it follows 
that d is also conjugate to c. If c is conjugate to d and if d is conjugate to / 
then c is also conjugate to f The mutually conjugate elements form separate 
equivalence classes. Thus, for an element z of the center of 3k 9 the p a must be 
identical for the elements of the same class. If k = Za a > where the sum is taken 
over all the a in a given class, we can therefore write z = Z* A K where this 
sum is to be taken over all classes. For arbitrary y k this sum is also an element 
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of the center. The dimension of the center is therefore equal to the number of 
classes in <g. 

On the other hand, z can be expanded as follows: z = e p J. In 

order that z commutes with all the elements of 0t 9 it suffices to show that z 
commutes with all the e$. Here we directly see that z must have the form 

z = X <f v) e (v> where e (v) = X e^. (7.8) 

V ' p 

Therefore the number of irreducible representations is equal to the number 
of classes of conjugate elements. 

8 Orthogonality Relations for the Elements of 
Irreducible Representation Matrices 

In the previous section we have seen that the g ( p l{a) form a complete linearly 
independent system of functions on the group. In addition, they satisfy 
certain orthogonality relations. To simplify notation we shall write G (v> (a) = 
(gpX a ))- If G <v> (a) is the matrix of the vth irreducible representation in a vector 
space i (v> (for example, if vl in the previous section) and G (tl \a) is the matrix for 
the /rth representation in then the matrix product 

T = X« G M (a)CG^\a -') 

with an arbitrary matrix C of n (v) rows and n iil) columns is a homomorphism 
of onto t (v) since 

bT = G (v> (b) £ G fv \a)CG M (a~ ‘) 

a 

= X G (v) (ba)CG ( ^(a ' x ) 

a 

= Y, G (v) (c)CG <m) (c“ 1 b) = X G ,v, (c)CG (M (c- l )G (ti> (b) 

c c 

= Tb. 

Thus (see AIV, §14) T must be the null operator for v # g; for v = g, T must 
be a multiple of the unit matrix E, that is, T = 8 vl Ji (v) E. Therefore we obtain 

X g ( K v l(a)c, p g<;Xa- l ) = <5 V „ p M 8 Kt . (8.1) 

a, A, p 

Since the c kp are completely arbitrary, it follows that 

X gZKakVKa- 1 ) = (8-2) 

a 

For v = g we set k = x and sum over r. Since 

X^v^v 1 ) = ^ 

T 

it therefore follows that 

X^a = /S 3 I^- that is, hd pX = wy\ 
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where h is the order of the group and n (v) is the degree of the representation. 
Thus we obtain 

\ I dttiaWKa -') = A* 4 (8 3) 

n a n 

In this sense the functions gf$(a) are mutually orthogonal. If the representa¬ 
tions are unitary, then we obtain 

= yja) 

and we therefore obtain 

\ I gVMKa) = AAA 4- (8-4) 

n a ft 

If 9C is an arbitrary representation space, if v is a vector from then for fixed 
A and v the u‘ v) = (n (v) /h) £ a g [ ~}(a ~ 1 )av span an invariant subspace i of SC 
since 

bn?' = \ Z 0&V "^bav 

ft a 

= ~ l b)cv 

= ~Y'L9 i ti(c~ 1 )cvgpKb) = z UpVpXb)- 

ft cp p 

i is therefore either null or the representation in t, is equivalent to G (v \a). 

If SC is finite dimensional, then the representation in SC can be reduced with 
the aid of the gr^a) as follows: Beginning with a basis {v v } of SC, construct, 
for each r v , in the manner described above, the subspace v Then SC = A K, 
where we eliminate null spaces and only include multiple occurring spaces t v 
once in the sum (since the are irreducible, we either have t v n = {0} or 

Ci = i v ). 

If we only wish to know the number of times a certain irreducible represen¬ 
tation occurs, then the character of the representation is often the most 
suitable way. The character of a representation is a function defined on the 
group as follows 

X(a ) = tr( G(a)), (8.5) 

where G(a) is the matrix for the representation. Since the trace is independent 
of the choice of basis, % is, the same function for all isomorphic representation 
spaces. 

Since x( a ) = x(bab ~ 1 ), / is a function of the class of conjugate elements. 
Let x <v> ( a ) denote the characters of the irreducible representations. We shall 
now show that the characters form a complete system of class functions. 
Since there are exactly as many characters as there are classes, it is only 
necessary to prove their linear independence. Since £ vp<r A^gr^a) = 0 is 
possible only for the case AJ)j = 0. It follows if we set Aj, v j = A (v) S pa , that 
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Zvp<t ^pa9pX a ) = A ( *Y v, (a) = 0 can only be satisfied if all the A (v) = 0. 

The character x can be uniquely expressed in terms of the y (v) as follows: 

X( a ) = I m (v) x (v) (a) (8.6) 

V 

It is easy to show that /(a) contains a particular y <v) (a) as often as the vth 
irreducible representation occurs, that is, m {v) times. The m (v) are therefore 
integers. 

If in (8.4) we set k = A and p = r, and we sum over k and p, then we would 
obtain 


r I y. (v} (a)x ( ^(a l ) = <5„ v 

^ a 

and, since, for finite groups, all representations are unitarily equivalent, we 
may choose the representations to be unitary, and obtain 

kx ,v) (« = y. (8.7) 

ft a 

Thus, from (8.7) and by multiplication with x (fl \ a ) and summing over a we 
obtain 


m (v) = ! I X (v) (a)x(a). 

” a 


( 8 . 8 ) 


9 Representations of the Symmetric Group 

The group of /! permutations of / things is called the symmetric group S. 
If is a left ideal of the group ring 3k % then it is possible to decompose 3k s 
as follows: 3k % = + 4- For the unit element e we therefore obtain e = 

e x + e 2 . Multiplying on the left by e x we obtain e l = e\ + e x e 2 and therefore 
e? = e 1 and e x e 2 = 0. e x is called an idempotent element. 3t s e x is identical 
to since 3k s e x is a left ideal and must be contained in k x ; for all elements 
a of/ x it follows from e = e x + e 2 , on the other hand, that a = ae x + ae 2 = 
ae x . In order to find the irreducible representation of S, we shall only seek 
those idempotent e v for which the left ideal defined by 'M s e v is irreducible. 

We shall now consider the permutations p to be operators which permute 
the order of / things a x , a 2 ,..., a f : 

PdlU 2 Clf U ai Cl X2 • ■ ■ Cly. 

For 


qa x a 2 • ■ • aj- — ap 1 ap 2 ■ • ■ cip f 


we obtain 


pqa x a 2 ■ ■ ■ a f — cip^ap^ • • • 
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Figure 50 


We construct a tableau with k rows with n l5 n 2 ,. ..,n k places 
(rii > n 2 > n 3 > ■ ■ ■ ^ n k , n v = /), as is shown in Figure 50 for the case 
in which / = 12, k = 4, n 1 = 4, n 2 = 3, n 3 = 3, n 4 = 2. We shall represent 
such a tableau by T„ where n is an abbreviation for n u n 2 ,... ,n k . Let m x 
denote the number of places in the first column, m 2 for the second, etc. Then 
m 3 > m 2 > ■ ■ ■ > m k , ^ = / and m t = k. 

If we arrange the / things a f into the places in the tableau, we then obtain 
a scheme. A particular sequence a x ■ ■ ■ a n of the a, may be called the initial 
sequence. If we arrange the a ; into the tableau T„ in its initial sequence 
beginning with the first row, then the second, etc., we obtain a special scheme 
which we shall denote by S n . We shall denote the result of applying a permu¬ 
tation p to the a, in the scheme S„ by pS„. For a given tableau T„ there are 
/! different schemes pS n . 

For each tableau the permutations r which only permute the elements of 
each row of S n in itself are uniquely determined; the same is also true for the 
permutations s which permute only the elements in the same column. There 
are therefore ]~[ v (n v )! permutations r and ( m n) • permutations of the 
form s. 

The (— l) p are equal to +1 for an even and — 1 for an odd permutation p. 
(A permutation p is said to be even or odd depending on whether the permu¬ 
tation of the u v results in the expression [],<„ («, — u^) changing sign or 
not.) For the elements 

/„ = £> and K„ = X(-l) s s (9.1) 

r s 

of the group ring it follows that 

rl n = I n r = (-1 YsK n = K n {- l) s s = K n . (9.2) 

Thus it follows that In = riv (»v)! In and K 2 n = [] M (m M )! K n . Therefore /„ 
and K n will be, with suitably chosen factors, idempotent elements. /„ and 
K„ are therefore left ideals in 9k s - They are, in general, not irreducible. We 
will now show that for a suitable actor y n 

e„ = y n I n Kn (9-3) 
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is an idempotent element and the left ideals PA S e„ are irreducible, and, for 
different n (that is, for different tableaux) are inequivalent. For this purpose 
we shall now prove the following combinatorial lemma: If pS n and p'S n . are 
two schema for which n > ri (that is n 1 > n\ or = n\ and n 2 > n' 2) or ...) 
such that in pS n two of the a t never are found in the same row which occur in 
p'S n ■ in the same column, then n = ri and there exist two permutations r, s 
such that pS n = rsp'S n , that is, rsp' = p. 

Proof. Assume that n l > n\. Since the a { of the first row in pS n must be placed into 
different columns in p'S n , T„. must have at least n { columns. It follows that n x = n[. 
There exists a permutation si such that in the schema s[p'S^ all of the a i of the first 
row of pS„ must be in the first column of . Since all elements a, of the second row 
of pS n must be in different columns of s[p'S „., it follows that n' 2 > n 2 , from which, 
together with n > ri it follows that ri 2 = n 2 . By a suitable permutation s 2 it is possible 
that all the elements of the second row of pS„ will also be in the second row of si s\p'S n .. 
By a repetitive argument, it follows that n = ri and that there is a sequence of per¬ 
mutations s[. such that in 1 ■ ■ ■ s' 2 s', p'S„, the elements a t appear in the same 

rows as in pS„. Therefore there exists a permutation r such that pS = rsp'S where 
S = s mi • • • Sj . 

If p is an arbitrary permutation, then for n > ri there exists in S„ a pair of 
elements in the same row which occupies places of the same column in 
p~ 1 S’„-. If t is the permutation which exchanges these two a ; in S n and t' is 
the corresponding permutation in p~ 1 S n -, then t! = p~ 1 tp. Thus, from (9.2) 
and since t 2 = 1 it follows that 

I'nPKn- = hPP~ l tpt'K n . = l„tpl'K n . = -I n pK n , 

from which it follows that I n pK n . = 0. 

Since this is true for each p, it is true for each element of 01 s , that is, 

I n ^ s K n : = 0 for n > ri. 

For L„ = I n K„ ( L„ ^ 0 because £ r , s (— l) s rs contains only one term with 
the unit element and the factor 1, namely, the case in which r = s = e) it 
follows that rL„ = L„ = L„s( — l) s . Conversely, if 

ra = a = as{— l) s (9.4) 

is satisfied for a ring element a it follows that 

a = AL„. 

To prove this, we write a = J]pP a (P)- From ras( — 1 ) s = a it follows that 
^] p rpsa(p)( — l) s = £ p pa(p). Therefore, if we set p = e in the sum on the left 
and if we set p — rs in the sum on the right we obtain a (rs) = (— 1 ) s a(e), that is, 
all coefficients a(p) where p may be written as the product of two permuta¬ 
tions r and s, are equal to ( — l) s a(e). If we could show that all the remaining 
a(p) vanish, then we would obtain a = u(e)L n . If p cannot be written in the 
form rs, then all the rsS„ are different from S„. According to the combina¬ 
torial lemma, in pS„ there are two elements a { in the same row which occur 
in the same column in S„. If t is the permutation whiclLexchanges these two 
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elements in pS„, then p' l tp does the same in S„. Thus t is a permutation of 
type r and t' = p~ 1 tp is one of type s, so that from 

Z rqsa(q)(-lf = J]qrx(q), 

<1 Q 

where we set r = t and s = p~ 1 tp it follows that 

- £ tW~ X tp<x(q) = £ qct(q). 

4 4 

Thus for the special case in which q = p we obtain 

oc(p) = -a(p), that is, a(p) = 0 

which concludes our proof. Since the element a = I n bK„ has the property 
(9.4) for an arbitrary element b of 01 s , it therefore follows that 

KbK n = X b l n K n = X b L„ 

and, in particular, 

L 2 n = I n (K n I n )K n = P„L„. 

For each b we therefore obtain L„bL„ = p b L n ■ /„ = ,# s L„ is an irreducible 
left ideal otherwise there would be an irreducible left ideal £' c £ n , and for an 
element b' from (' it would follow that L n b' = pL n . Since for b' it follows that 
all a b' are in £', either we have L n £' = 0 or L n £' = (L„) where (L„) is the one¬ 
dimensional subspace of spanned by L„. From LJ' = (L„) it follows that 
/„ =S/t s L n = L„£' <=£„£'<= £', that is, £' = £„. From L n £' = 0 it follows 
that tf = 0. If we decompose e — e' + e according to the decomposition 
& s = £' + then, upon multiplication with e' it follows that e' 2 = e' and 
e e = 0 and finally t' = .^ s e'. Since /'/' = 0, we obtain e' 2 = 0, that is, 
e' = 0 and therefore £' = 0. 

We find that L 2 ^ 0 and therefore /i„ # 0, otherwise from L 2 = 0, and 
from £ n L n = 0 it follows that the trace of the map of onto itself defined 
by a -» aL„ would be zero, a result which is easily obtained from the choice 
of a basis for the decomposition = /„ + t. For the basis of the group 
elements it follows that the trace is f\L n (e) = /! where L n (p ) is given by 
L„ = £ P pL n (p)- From this result it follows that if g is the dimension of i n for 
fi n , we obtain 

Pn = 9~ 1 f'- 

since the trace is, according to the first method, equal to (i n g. g is computed 
in XIV (8.28). From p n # 0 it follows that e n = p~ 1 L„ is an idempotent 
element (that is, e 2 = e n ) of t n . It remains to show that two left ideals £ n . 
correspond to inequivalent representations. Suppose, for example, that 
n > n', then we obtain 

lj n , = I n @ s L n , = IM n ,K n , cz I n ® s K n , = 0. 
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/„ will therefore be represented in by 0. Suppose /„ = , then I n must also 

be represented in 4 by 0. We note, however, that 

l n L n = I„I„K n = n M! I n K n = n («»)! K # 0. 

V V 

We therefore have as many irreducible inequivalent representations as there 
are different tableaux, that is, as many as there are decompositions of / into 
a sum of integers / = n v . This number is identical to the number of classes 
of conjugate elements in S. In order to prove this we shall write the permuta¬ 
tions in “cycle representation” 

P = • •' z n k , (9.5) 

where, for example, the cycle z 3 = (2 4 1) yields the following permutation 
(2 4 I)fl 1 a 2 a 3 fl 4 a 5 = a 2 a A a 3 a 1 a 5 . 

The z„. in (9.5) contain only different numbers. Since each such p can be 
uniquely represented in the form (9.5), and the cycles z„. in (9.5) commute, 
the desired result is not difficult to prove. It is easy to see that 

qZ ni -' Z n k q~ l = Z' ni z' n 2 ---Z' nk , 

where the cycles z'„ t have the same “length” n ; as do the z„ { , only the numbers 
in the cycles are permuted by q. A class of conjugate elements will therefore 
consist of all permutations for which the cycles are of the same length n t 
where n t = f. 

10 Topological Groups 

Topological groups play an important role in physics. The physical meaning 
of topology has been briefly outlined in VI, §1. We shall only attempt to 
provide a few important fundamental aspects of topological groups without 
detailed proof. The most important theorems for physics are proved in [41]. 

10.1 The Species of Structure: Topological Group 

A group is called a topological group if a topology (All, §1) is defined upon 
'$ for which the map (a, b) ab defined on f x f -> is continuous and 
the map a -» a~ 1 is continuous. These conditions are equivalent to the single 
condition—the map (a, b) -* ab" 1 defined on f# x ^ ^ is continuous. 

It therefore follows that, for fixed a, the map a -> ax is a homeomorphism 
that is, a topological isomorphism—of G onto itself. The maps x -* xa 
and x -> x" 1 are also homeomorphisms. 

Thus it follows that if V is the neighborhood filter of the unit element, 
then a’V = {aV | V e "V') and T" a are neighborhood filters of a. The topology 
of a group is therefore determined by the neighborhood filter of the unit 
element. 
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Conversely, if a filter F is given in such that the following conditions are 
satisfied: 

(1) The unit element is contained in each element V of F; 

(2) for each U e F there exists a Ve F such that V Fez U; 

(3) U e F implies that U ~ 1 e F; 

(4) for each aeG and U e F it follows that aUo" 1 e F 

(see TG 1—TG 3 in VI, §1.1) then this filter defines a topology on <& for which 
^ is a topological group. For a neighborhood filter of an arbitrary element 
a we need only choose the set of all aV where Ve F 

A neighborhood V of the unit element for which F 1 = V is said to be 
symmetric. For arbitrary V, FuF 1 , V n F \ VV~ l are symmetric 
neighborhoods. Thus it easily follows that there exists a set of symmetric 
neighborhoods of unity which form a fundamental system of neighborhoods. 

We shall only consider separated topological groups. The topology is 
separating if and only if the intersection of all neighborhoods of the unit 
element is the unit element. 

The general theorems about subgroups and invariant subgroups in §§1-4 
can be extended as follows: If F is a subgroup of <&, then its closure if is also 
a subgroup. If F is an invariant subgroup, then so is F The center of ^ is a 
closed subgroup of < S. 

A subgroup F of <8 is open if it has at least one interior point, because if 
a e F is an inner point of F then so is b = (ba~ l )a. An open subgroup F is 
also closed: Let a e F and let V be a symmetric neighborhood of the unit 
element with V <= F. Then we therefore obtain a V n F # (f>. Therefore there 
exists abeaV n F Thus, since V is symmetric, a ebV <= bF <= that is, 
ae 

If V is a symmetric neighborhood of the unit element, then the set of all 
products ni-ia, where a v eV is a subgroup of <&. Since the unit element 
is an interior point of is both open and closed. Therefore is an 
open set. If ^ ^ then <§ is not connected (see All, §4). If <& is connected, 
then 0 is generated by any neighborhood V of the unit element, that is, 
every element a of ^ may be represented in the form a = n? = t a v with a v e V 

Let Jf be the connected component (All, §4) of the unit element. With 
ae.yV, a~ l jV must be a connected subset of <&. Since eeaT^JV it follows 
that a~ l .A r <= Jf, that is, JV~ l ,AV cz JV, which means that F is a subgroup. 
For be^ it follows that bjVb~ x is connected, and e e b.A r F 1 , that is, 
£>Ffc~ 1 c Jf and therefore b./V'b ~ 1 = F since x -> bxb ~ 1 is an automorph¬ 
ism of ( S. Therefore F is an invariant subgroup. 

For b e it follows that hF = Fh“ 1 is a connected set which contains b, 
that is, fcF cz F where F is the connected component of b. For b e F it 
follows that F’Fci; that is, F c: bF and we therefore find that F = bF 

Since F is a connected group, it is generated by all products of the form 
n-=Fv where a v e V n F where V is an arbitrary neighborhood of e. 
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10.2 Uniform Structures of Groups 

In a topological group ^ it is a simple matter to define a uniform structure. If 
for every neighborhood V of the unit element e we assign a vicinity 

{(x, y) | yx ~ 1 e V} 

we obtain a fundamental system of vicinities, and therefore obtain a uniform 
structure called the right uniform structure. Similarly, a left uniform structure 
may be defined by {(x, y) |x -1 y e V}. The maps x -> ax and x -> xa are 
isomorphisms for both uniform structures. The map x->x -1 is an iso¬ 
morphism between the left and right structures. In physical applications most 
groups are such that the left and right unform structures are identical. 

If the map x -> x ~ 1 transforms Cauchy filters of the right uniform structure 
into Cauchy filters of the right uniform structure, then ^ has a completion 
and the group structure can be extended to #. The right and the left uniform 
structures produce the same # (for a proof, see [12]). 

If ^ is complete with respect to the right uniform structure, then it is also 
complete with respect to the left uniform structure and vice versa (for proof, 
see [12]). 

If a group has a neighborhood V of e which is either right or left 
complete, then ^ is complete. In order to prove this statement, consider a 
Cauchy filter 3F in < S. Since 3F is a Cauchy filter, there exists an element 
Mef which is small of the order of {(x, y)|yx _1 e V}. For x, e M we 
therefore obtain M cz Vx 1 . Since Vx t is complete, the Cauchy filter has a 
limit point x 0 e Vx 1 , which also must be a limit point of 3F in < S. 

Since every compact set is complete, it follows that every locally compact 
group is complete. For a compact group the right- and left-handed uniform 
structures must, of course, be identical, namely that of the compact space. 

If there exists a denumerable neighborhood basis of the unit element, then 

is metrizable, a result which follows easily from All, §2. 

In applications we only find metrizable and separable groups. It is there¬ 
fore always possible to introduce a new uniform structure ph of physical 
imprecision such that is precompact. For this purpose we choose a metric 

fM 

d(x, y). We consider all maps ( S ^ [0, 1] where /^(x) = n~ 1 tg~ 1 d(xa v , a„) 
and f ( v 2 J(x) = n~ 1 tg~ 1 d(a v x, a lt ) where {aj is a countable dense subset in 
f#. The initial uniform structure ph corresponding to the is then pre¬ 
compact and the maps x ax, x -» xa are uniformly continuous. For the 
completion @ ph the group structure can be lost! 


10.3 Lie Groups 

We shall not present a detailed description of the theory of Lie groups. 
Readers who are interested in the full mathematical theory are referred to 
the specialized literature, in particular, to [13]. Instead, we shall develop 
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some of the fundamental concepts which are of considerable importance for 
quantum mechanics. This importance induced some authors to seek a 
formulation of quantum mechanics based entirely on purely group theoretical 
methods (possibly in a mystifying way). This section is for those readers who 
do not yet have an intensive familiarity with quantum mechanics. 

A Lie group ^ is a finite-dimensional topological group which has an 
open neighborhood U 0 of the unit element which can be mapped homeo- 
morphically onto an open set of R p in such a way that the group operations 
are continuously differentiable. This means that for a e U 0 there exists a 
correspondence a <-» a 1 ,..., a p (where for simplicity we assume that 
e«->0,0,...,0) such that for b <-> jS 1 ,..., j3 p , for c = ab and c <-» y 1 ,..., y p , 
the functions y v = /„(a 1 ,..., a p ; , fi p ) are continuously differentiable. 
The correspondence a <-> a 1 ,..., a p is called a chart in the neighborhood U 0 
of e. Other charts can be obtained by using continuously differentiable 
transformations. 

From U c — cU 0 we obtain a neighborhood of the element c. Each b e U c 
may then be written as follows: b — ca where aeU 0 . From a <-► a 1 ,..., a p 
and b = ca <-> a <-» a 1 ,..., a p we obtain a chart b <-> a 1 ,..., a p for the 
elements of U c . 

Let c e U d and deU c . U c n U d is then an open set. Let / eU c n U d . Since 
/ is an element of U c we may have / <-> y 1 ,..., y p in the chart of U c ; since / is 
an element of U d we may have/<-> d 1 ,... S p in the chart of U d . Thus we have 
<5 V = ■■■, y p ). We will now show that the g v are continuously differ¬ 

entiable. We have / = ca — db where a <-> y 1 ,..., y p and b S 1 ,..., 3 P . It 
follows that b = d~ l f = d~ 1 ca. Since c g U d there exists an x e U 0 such that 
c = dx. Therefore b = xa from which it follows that the g v are continuously 
differentiable. In this way ^ is a differentiable manifold. It is not difficult to 
show, using similar arguments that the map (a, b) -> ab is continuously 
differentiable. 

By left multiplication each a e G corresponds to a transformation 
A: Ab — ab. For a chart a , a p in the neighborhood of e we may 

define infinitesimal transformations / v as follows: 7 V = (3a/<5a v ) a=0 . For 
an a which is a neighbor of e we write a — e — Sa = I v da'. 

In this way the / v form a linearly independent basis in the cotangent space 
(as a real vector space) at e. The /„ are not group elements but are maps 
b -»■ I v b of the elements be <3 in the cotangent space at b defined as follows 

Yj I v da'b — ( da)b = ab — b = Sb. 

V 

If a(s) is a differentiable curve [0, 1] -+ where a(0) = e and a(l) = a, then 
it follows that a(s + ds)a(s)~ 1 - e = Z v I v 8of with the differential form 

= I q;(a(s))S[i p , 

where the p p are coordinates of a chart containing a(s). It follows that 

da(s) = X a(s)Sco\s). (10.3.1) 
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The differential equations (10.3.1) together with the initial conditions 
a(0) = e yields the solution a(l) = a. The infinitesimal transformations J v 
determine the entire group r S (as transformations of g into itself). 

If we choose a curve a(s ) for which Sa) v (s) = p v ds where p' is constant, 
(10.3.1) becomes 


with the solution 


a(s ) = exp 



and we therefore obtain 


a = a(l) = exp 



(10.3.2) 


Therefore there exists at least one neighborhood of e in which it is possible 
to introduce a chart a <-» {p v } such that (10.3.2) is satisfied. Then for the 
original chart of the a v :(dp7<bc v ) a=0 = S J. 

The p' 1 are called the canonical parameters of the Lie group. With respect 
to the canonical parameters the group operations are analytic functions of 
the parameters. In particular, in a Lie group it is always possible to choose 
charts such that the group operations can be differentiated more than once. 

Equation (10.3.2) gives us the map b -* ab if the maps b -> I v b are known 
for all beg. According to §10.1 each element of the subset of g which is 
connected to e can be expressed as a product of elements from an arbitrary 
neighborhood U of the element e. Therefore it suffices to know the map 
b -*I v b for the elements beU. Since the b eU are neighbors of the element e, 
the following question is relevant: What aspects of the structure of /„ deter¬ 
mine the subset of g which is connected with e? The theory of Lie algebras 
answers this question. Here we shall only derive the relevant structure, 
without proving the existence of Lie groups nor providing a general theory 
of the structure of Lie algebras (see, for example, [13]). This structure is of 
particular importance for quantum mechanics. 

For an element a — e + £ v I v da' which is a neighbor of e it follows that, 
for any beg, the element bab~ l is a neighbor of e: 


bab 1 = e + £ 

that is: 

M*& -1 = EW fe )- (10-3-3) 

For b = b 1 b 2 it follows that 


£ h<( b i b 2) = ki(MA >1 1 


= b lYj I P <T v( b 2) b l 1 
p 


£/ t o-;(hiK(h 2 ). 
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Thus (10.3.3) is a representation of in the vector space spanned by the / v . 
If, in (10.3.3) we choose for b an element which is a neighbor of e, 

b = e + £ b dp p , i 

P 

then from (10.3.3) it follows that, since b~ 1 = e — £ p I p djf 

Up, U = Vv - KI P = I IpC%, (10.3.4) 

p 

where 



The structure constants c(f p are therefore well defined by the group C S. From 
(10.3.4) it follows directly that c£ p = — c£ v . From the easily proven Jacobi 
identity 

Up, Up, IJ] + Up, Uy, Ipf] + Uy, Up, IpS] = 0 (10.3.5) 

it follows that, for the c£ v 

E ( c %< + + CUD = 0. (10.3.6) 

X 

Given a real vector space defined by the basis J v , and introducing a “product” 
of I p and J v as follows 

P 

for which [J p , J v ] = — [J v , J p ] and (10.3.5) holds, then this vector space is 
called a Lie algebra. Is there a Lie group which corresponds to each Lie 
algebra? This is indeed the case, a fact which we shall not prove here (see, for 
example, [13]). 

In closing this section we shall consider the following special case: Let ^ 
be a transformation group of a space into itself, that is, there exists a map 
0 x X -> X such that h(a u x) = h(a 2 , x ) for all x implies a x = a 2 . Instead 
of writing h(a, x ) we shall use the simpler notation ax. If X is a differentiable 
manifold and if in there is a chart for a neighborhood of e such that the 
map x X -> AT is continuously differentiable, then all of the above con¬ 
siderations about ^ x -> G are applicable to x X -> X. In particular, 
the infinitesimal transformations J v can be considered as transformations 

(e + Y, h da v jx = x + <5x, 

where 

<5x = £ 7 v x dot” 

V 

of x into the cotangent space at the position x. Thus, the transformation ax is, 
known from (10.3.1), (10.3.2) if the J v x are known. 
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Conversely, if the transformations / v of X into the cotangent space are 
such that they form a Lie algebra, then the transformations a of I into 
itself are defined by (10.3.2). It can be shown that these transformations form 
a Lie group (see [13]). 

10.4 Representations of Topological Groups 

By a representation of a topological group 9 we mean a homomorphic and 
continuous map of 9 into a topological group of linear transformations of a 
vector space (see, for example, the map in the group in VI, §1 or in the 
group of unitary operators of a Hilbert space). 

A homomorphism / of a topological group 9 into a topological group 9' 
is continuous if it is continuous at a point (for example, at the unit element). 
If / is continuous, then the invariant subgroup f~ 1 (e') is closed in 9 since e' 
is a closed set in 9'. If Jf is the connected component of e in ^ (see §10.1) then 
/ (JT) is also connected (All, §4) and is therefore a subgroup of the connected 
component JV' of e' in 9'. If / is a bijection and a homeomorphism, then / is 
called an isomorphism of the topological groups 9, 9\ 

Let U be a neighborhood of the unit element ee9 and let / be a continuous 
map U -4 9' which satisfies the following conditions: From a,b,ceU and 
ab = c imply / (a) f(b) = f ( c ). Then / is called a local homomorphism. If 
f(U ) is a neighborhood of e' in 9' and 1/ 4/(1/) is bijective and / is a 
homeomorphism of U onto /( U ), then / is called a local isomorphy of 9 and 
9'. 

The question remains whether a local homomorphism can be extended 
as a homomorphism over all of 9. Certainly this can be the case only if 9 is 
connected, since in §10.1 we have seen that only the component which is 
connected with e can be represented as a product of finitely many elements 
of U. 

If 9 is connected, then every element be9 can be written in the form 
* = n" =1 «v where a v e U. Thus, in order that / be a homomorphism, 
m = re. x f (a v ) must be satisfied. This condition can be satisfied only if 

n"=i a v = n?=i < and flv.aj.el/ imply that Y\Uif( a .) = 11™=! /«)• 
This is, however, not generally the case. 

We say that 9 is simply connected if 9 is linearly connected (see All, §4— 
every connected Lie group is also linearly connected) and every closed path 
can be continuously shrunk to a point. If 9 is simply connected, then the local 
homomorphic map / can be extended to all of 9. 

In order to prove this statement, we consider a path b(s) with 0 < s < 1 
and b( 0) = e, b( 1) = b. Since the set 0 < s < 1 is compact, to each neighbor¬ 
hood V of e there exists an e such that b(s') e b(s)V, that is, b(s)~ 1 b(s') e V for 
I s' - s| < e. 

If U is a neighborhood of e and U 4 9' is a local homomorphism, we 
choose a symmetric neighborhood V for which VV c= U. To V there exists 
an e of the type described above. Thus it follows that there exists an increasing 
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sequence of finitely many s v (where s 0 = 0, s„ = 1) such that b(s)~ 1 b(s') e V 
for s v _j < s, s' < s v+1 . For c v = b(s v+1 )~ 1 b(s v ) it follows that f] v c v = b. 
If we set /( b ) = \\ f (c v ) it follows that for every refinement of the partition 
{s v } we obtain the same value f(b). Since two partitions have a common 
refinement, it follows that /( b ) depends only on the path b(s). Similarly, we 
see that a continuous change of the path b(s) from e to b does not change the 
value /( b ). Since @ was assumed to be simply connected, it follows that the 
local homomorphism can be continued in one and only one way. 

The continuation / is a homomorphism if /(a) f(b) = /( ab ). In order to 
show this we consider a path C x of e to a and a path C 2 from e to b. Let C 2 
be defined by b(s) where b( 0) = e, b( 1) = b. Then there is a path aC 2 from a to 
ab given by ab(s). C x + aC 2 is then a path from e to a and then to b. f(ab ) 
can be considered to be a continuation of / on the path C 1 + aC 2 . By 
using a sufficiently fine partition of the path, from local homomorphisms it 
follows that / (ab) = /( a)f (b ). 

If / (U) is a neighborhood of e' in W and if is connected then / (JS) = f#', 
which follows directly from the fact that every element of may be repre¬ 
sented as a product of elements from f(U). 

If / is a local isomorphism, then for the extended /: f(4f) = The 
extended / must not be an isomorphism. If Jf is the closed invariant sub¬ 
group which is mapped upon e', then Jf must be a discrete subset of 
that is, around each neJ r there exists a neighborhood in ( § around n which 
contains only n as elements of Jf. Otherwise there would be an element n 1 of 
Jf such that every neighborhood of n 1 would contain an additional 
element of Jf and if U is a neighborhood of e then n 1 U would contain an 
element n 2 J n 1 (and U would contain the element n 2 l n 2 # e) which con¬ 
tradicts the fact that / is a local isomorphism. 

Jf must be a subset of the center of < S. Proof: Since Jf is an invariant 
subgroup, neJ r implies that ana _1 eJf. For fixed n, ana -1 is continuous 
with a. Since f is connected and Jf is discrete, ana" 1 must, for continuously 
varying a be constant and therefore equal to ene ~ 1 = n, that is, a commutes 
with n. Conversely, if Jf is a discrete subgroup of the center of < §, ftjJf is a 
topological group if we choose the sets XJjJf as neighborhoods of the unit 
element for ^jjf where U are the neighborhoods of the unit element in < S. 
Here it is clear that the canonical map r S -> ‘SjJf is a local isomorphism. 

In our consideration of the continuation of local homomorphisms we have 
assumed that f# is simply connected. If fj is (linear, but) multiply connected, 
then by the transition to the so-called covering group <§*, we may reduce our 
results for the continuation of local homomorphisms to the case of simply 
connected, since is simply connected. 

If C 2 is a path from e to b, then aC 2 is a path from a to ab. For as a 
path from e to a then C x + aC 2 is a path from e to a and then to ab. If C' 2 
is a path from e to b which can be continuously developed from the path 
C 2 and if C\ can be continuously developed from C 1; then C\ + aC 2 is a 
path from e to ab which can be continuously developed from C, + aC 2 . If 
instead of the paths from e to a we consider equivalence classes of paths 
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Cj defined from the equivalence relation C\ ~ C x , then C, + aC 2 uniquely 
defines a class which we shall denote by <€ x + a c t> 2 . Here CJ ~ C x means 
that CJ can be continuously developed from C x . 

The covering group 7 #* is defined as the set of all pairs (a, 7 ?) where a e % 
and ^ is a class of paths from e to a (since can be multiply connected, to a 
given a there can be several classes) and multiplication is defined by 
(a, <? 2 ) = ( ab, ( € x + a ( € 2 ). The unit element of is equal to (e, 0) 

where 0 is the class of closed paths C from e to e which can be continuously 
shrunk to a point. The element (a -1 , W) which is reciprocal to (a, <&) is 
obtained by constructing the path C'\a~ 1 (s) corresponding to the path 
C:a(s ) where a(0) = e, a( 1) = a. 

The topology in is introduced by means of a neighborhood filter of 
(e, 0): V = {(a, <€ )\aeU , and there exists a Cef with C <= U} where U 
runs through the neighborhoods of e in < S. In this way becomes a topo¬ 
logical group. The map (a, <&)-> a of to f# is a continuous homo¬ 
morphism. The invariant subgroup which is mapped onto e is Jf = {(e, <€)}. 
Jf is called the fundamental group of ( S. 

is simply connected because a continuous path from (e, 0 ) to (a, ( £) is 
characterized by a path C of e to a where C e ( €. 

If there exists in ^ a simply connected neighborhood of e (which is the 
case for all Lie groups) then the map (a, <if) -> a is a local isomorphism of f#* 
to e S. The fundamental group is then a subgroup of the center of <§*. 

If ^ -> W is a local homomorphic map then -»• ^ ->■ <$' is also a local 
homomorphism. The local homomorphism -> may be uniquely 
extended as a homomorphism -> '’S' . 

The most important example physically is the three-dimensional rotation 
group which was denoted by @ 9 in VII. The topological group can be 
characterized in terms of the following three-dimensional parameter space 
(by which becomes a Lie group): The parameters y. x , a 2 , a 3 correspond 
to a unit vector aja, a 2 /a, a 3 /a where a = (af + tx\ + a 3 ) 1/2 ; the unit vector 
corresponds to the axis of rotation and a corresponds to the rotation angle 
(in the right-handed sense). We require that 0 < a < n. The points a 1; a 2 , 
a 3 are therefore the points of a sphere of radius n (see Figure 51). Two rota¬ 
tions a 1; ot 2 > (with a = n) and — — a 2 , — a 3 are identical, that is, we 

must identify diagonally opposite points of the sphere (see Figure 51). 
Clearly is compact, but is not simply connected, because in Figure 52 
there is a closed path which cannot be shrunk to a point. The covering group 
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2% consists (for every a e 2$) of two elements (a, & x ), (a, f £ 2 ) where c € x and 
^2 are characterized by the two paths C x and C 2 in Figure 53. All other paths 
from e to a can be continuously transformed into either C x or C 2 . If, for 
example, we use two pairs of diametrically opposite points P x , P' and 
P 2 , P' 2 (see Figure 54) then we can move P 2 continuously toward P\ whereby 
P' 2 is moved towards P x . The path P' X P 2 can then be ignored and the remain¬ 
ing path can be detached from P' 2 = P x and then formed to C x . 2g is iso¬ 
morphic to 2%fjf where the fundamental group Jf consists of the two 
elements (e , 0) and (e/4 ) and ( € is characterized by the path given in Figure 
52. It is easy to show that ( e, ( €) 2 = (e, 0). The fundamental group is cyclic 
of order 2. 

Of particular importance are the above considerations as applied to Lie 
groups. If in a finite-dimensional vector space a representation of a Lie 
algebra is given in terms of linear transformations which satisfies the commu¬ 
tation relation (10.3.4), then we obtain a local homomorphism of the Lie 
group in the form (10.3.2). A representation of the Lie algebra in terms of 
self-adjoint (not necessarily bounded!) operators in a Hilbert space does 
not automatically lead to a local homomorphism of the Lie group in terms 
of unitary operators. Additional conditions for the definition domain must 
be satisfied (see, for example, [5]). 


10.5 Group Rings of Compact Lie Groups 


Much of the structure of the representation of finite groups can be extended 
by analogy to compact groups ( S. This is a result of the fact that it is possible 
to define an invariant measure on the Borel subsets of and that the measure 
of is finite. Here it is possible to replace sums over finite number of elements 
of a group by integrals over ( S. 

A left-invariant measure m is defined by the requirement that for every 
Borel set 3T the following relation m(a,T) = m(,T) holds. In order to find 
such measures it is sufficient to determine m for the Borel subsets of a neigh¬ 
borhood U of the unit element, then, for every open set V cz U the relation 
m(aV) = m(V) must be satisfied and the aV cover all of G. A measure m 
for a Lie group can be explicitly given. 

We shall use 

f = / v (a 1 ,...,a'’; ) S 1 ,...,/?) 


which was given in §10.3 for a chart for the vicinity of the unit element e. A 
tangential vector d a at e will be transformed into the vector d p at a by the 
equation 



Using the functional determinant (Jacobian) 


A(a x ,..., a p ) 



(10.5.1) 
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permits us to define the following measure m 


m{,T) = c 


J 


4 da 1 ■ • • da p — 
A 



(10.5.2) 


which satisfies m(a&~) = m(.T). If the group f# is compact, then it is possible 
to cover with a finite number of sets a U where U is an arbitrary neighbor¬ 
hood of e. Thus m(A) is finite. We choose the constant c such that m(Xi) = 1. 

If is compact, then the above defined measure is also right invariant. 
If, for a neighborhood U of e the relation m(aUa~ l ) = m(U ) holds then m 
is also right invariant because it follows that m(Ua) = m(a~ 1 Ua) = m(U) 
since m is left invariant. To prove m(aUa ' ! ) = m(U) it suffices to prove that 
the determinant A = | cr(f(a) | = 1 where the a p are defined in (10.3.3). Accord¬ 
ing to §10.3 the (Jy(a) form a representation of the group. The sequence of 
elements a” (n = 0,1, 2,...) must, in a compact 0 have an accumulation 
point b. Thus there exists a subsequence a v ‘ which converges towards b. 
Thus it follows that A v ‘ -> B = \(T p (b)\. Since b 1 is represented by the 
reciprocal matrix, B # 0 and therefore we find that A = 1. 

In addition, for m we obtain 

m(3T) = miAT- 1 ) (10.5.3) 

where ST ~ 1 is the set of reciprocal elements of the set -T. It suffices to show 
that this is true for the case in which F = aU where U is a neighborhood of e. 
It follows that = U Therefore it is sufficient to show that m(U) = 
m(U~ x ) for a small neighborhood of e. An element of U then has the param¬ 
eter da v ; its reciprocal has the parameter — da', from which it follows that 
m(U) = m(U ~ J ). 

For an integrable function / over ^ we therefore obtain 

f f(a)dm(a)= f f(ab)dm(a)= f f(ba)dm(a). (10.5.4) 

J& 


If we reexamine the considerations of §8, it is easy to show that all theorems 
are applicable providing that we replace (1/h) • • • by j • • • dm(a). For the 

matrix elements gty(a) of irreducible unitary finite-dimensional representa¬ 
tions (where different v index different representations) the following 
orthogonality relations hold: 

dm(a) = 6^b tK 6 pk (10.5.5) 

and the following equation for the characters hold: 

J X^X^X^Xa) dm(a) = <5„ v . (10.5.6) 

We have not yet, of course, shown that there are sufficiently many irreducible 
representations such that the g ( T v J(a) form a complete system of functions over 
f# and that the x <v> (a) form a complete system of functions over the classes 
ofS?. 
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That this, is indeed the case we shall show with the help of group rings for 
compact groups. The measure dm introduced above defines a Hilbert space 
,£ 2 {£, dm) (see the end of AIV, §1). We form an algebra by introducing the 
following definition of a product: For/, g e J£ 2 (£, dm) we define 

(/• = \f(ab ' l )g(b) dm(b) (10.5.7) 


by analogy with the definition for finite groups (§7): 


z 



Z f(a')g(b)a'b 

a', b 


= Z a Z f( ab v M b )’ 

a b 


where we set a = a'b. It can be easily proven with the aid of the Schwartz 
inequality (AIV, §1) that / • g e f£ 2 {£, dm). 

We now use the matrix elements g£J(a) for the different inequivalent 
finite-dimensional unitary representations of the group £ which are elements 
of £‘ 2 (£, dm). If the g^l(a) are continuous, then, since m(£) = 1 we obtain 
g^Ka) e ,£ 2 (£, dm). There exists at least one such representation, namely 
the (one-dimensional) identity representation. According to (10.5.5) the 41 
form an orthogonal system in ,£ 2 (£, dm). The fact that this orthogonal 
system is complete will be proven below. First we shall derive a few relation¬ 
ships which are analogous to those derived in §7. 

We define 


41(4 = n (v Vti(a ')• (10.5.8) 

Thus, from (10.5.5) we obtain 

(41 • 4p)(4 = n (v) n Ul) jg£(ba~ y )g^X b ~ ‘) dm(b) 

= n (v) n w J £ 41 ( 4414 " 441(4 dm(b) 

= 4p4 t 4 V p(4, 


that is, 


41' = <444 




which is identical to the relation (7.6). 


(10.5.9) 
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The e \ c v ] (v and A fixed) span a finite-dimensional subspace 4 V) of dm) 

which is a left ideal because 


(/ • 41)(a) = J f(ab - ^(b) dm(b) 


= « (v) |/(c)3S(a“ 4') dm(c) 


= n (v) 



= EbV^lfa)- where = f f(c)g ( p K (c) dm(c). 

P J 


Similarly it can be shown that all the 41 (only v is fixed) span a dual ideal. 

In the ring j£? :2 (^, dm) the elements of ^ are not present. We may, however, 
define the elements of £ as operators in the Hilbert space i? 2 (^, dm) as 
follows: 

(afyb) = f(a~ i b). (10.5.10) 


It is easy to show that (10.5.10) defines a unitary (infinite-dimensional) 
representation of £ in -£ 2 (£S, dm). It follows that 

(ae ( £)(b) = 41( a~ l b) = n fv) g^l(b~ 1 a) 

= E 41(b)^4(a)- 

a 

Thus 4 V) is an invariant subspace and contains the vth irreducible 
representation of ( S. The 41 form an orthogonal system where 

| ^Ka)e^(a) dm(a) = <5 V „ S pK 6 ka n w (10.5.11) 

a result which follows directly from (10.5.5) and (10.5.8). If the 41 f° rm a 
complete orthogonal system, i? 2 (^, dm) decomposes in the form 

se : X<§, dm) = E © 4 V) = E © ^ <v) > (10.5.12) 

v, A v 

where / (v) = Ea© 4 v> - The £ v) are finite-dimensional and two-sided ideals. 

If the 41 are not complete, then there exists a vector r / 0 which is ortho¬ 
gonal to all 41- For r we define 

K(d, b) = J r(b ~ 1 dc)r(c) dm(c) 
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and we define the operator K: 

(Kf)(a) = J K(a, b) f(b) dm(b). 

We may therefore also write Kf = f -r-f where r(a ) = r(a~ v ). First we 
show that K is a self-adjoint and compact operator (see AIV, §§4 and 9). The 
fact that K is self-adjoint follows from 

K(b, d) = J r(d ' 1 i>c)r(c) dm(c ) 

= Jr(c')r(h ~ 1 dc r ) dm(c’) = K{d, b). 


K is compact, since from 

I K(d, b)\ < j\r(c)\ 2 dm(c) 
it follows that the following relation 

J|K(d, b) | 2 dm(d) dm(b) < oo 

holds. 

Thus K has only discrete eigenvalues, of which those which are nonzero 
can only be finitely degenerate. Since r # 0, from 

</, Kf; = J f(d)r(b ' 1 dc)r(c)f(b) dm(d ) dm(b) dm(c) 


= J/ (d)r(b l c l )r(d l c i )f(b) dm(d) dm(b) dm(c) 
= J J r(b~^c')f (b) dm(b) dm(c') 


it follows that K is not the null operator, that is K must have a nonzero 
eigenvalue. 

From 


(aKf )(d) = | K(a ~ 1 d, b) f(b) dm(b ) 

= | K(d, ab)f(b) dm(b) 

= J iC(d, fo')/(a~ x b') dm(fo') 
= (Kaf)(d) 
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it follows that K commutes with all elements of ( S. A finite-dimensional 
eigenspace <s of K with a nonzero eigenvalue A is therefore an invariant 
subspace, that is, a representation space of C S. This may be reduced so that 
in o there exists an irreducible representation, which we call the vth represen¬ 
tation. Let f p e a be the basis vectors which corresponds to this vth representa¬ 
tion. We obtain Kf p = Xf p . 

We will now show that f p e / <v) where / v) is defined above. For e (v) = 
Z T it follows from/ e / <v) that e iv) ■ f = f • e <v) = /. 

Since / v) is a two-sided ideal, it follows that for arbitrary g : e (v> ■ g e / (v) 
and g ■ e (v) e A <v) . If for a g the relation e (v) • g = g it therefore follows that 
g e / v) and 3 • e M = g. 

With / p e d defined above it follows that 

(e (v) • / P )(a) = | e ly) (b) fp(b ' i a) dm(b ) 

= J e (v) (b)bfp(a) dm(b ) 

= jl e^{b)f a (a)9 { :l(b) dm(b) 

= « (v) Z /» f Z 9 ^)C(b) dm(b) 

a J t 

= fM 

Therefore it follows that/, e / (v) and also/, • e (v} = f p . Since Kf p = f p r r = 
Xf p we obtain 

/’■ gW = l/ , ' r ' f ' gW - 

Since r is orthogonal to / <v) it follows that 



since e (v \ba) is, for fixed a, an element of the two-sided ideal / <v) . Therefore 
we should obtain/, = f p ■ e (v) = 0. Thus it follows that r = 0. 

For an example of a volume measure on a compact group we shall consider 
U„ the unitary group in an n-dimensional space t. Each unitary transforma¬ 
tion A can be represented in terms of a basis in 1 as follows: 

A = UEU + , (10.5.13) 

where £ is a diagonal matrix (with the diagonal elements e v = e‘ Xv ) and U is 
a transformation which transforms the chosen basis of t (in which E is 
diagonal) into the basis of the eigenvectors of A. For A we will choose the 
first n parameters to be the angles a v . These parameters have the advantage 
that two different transformations with the same parameters a v belong to 
the same class. 
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How many additional parameters do we need in order to determine A ? 
U is, for fixed E, not uniquely determined. Since U transforms the given 
basis vectors into the eigenvectors of A, that is, into another basis, U is 
determined by the basis vectors which are determined up to a factor e ld . 
For the first basis vector we therefore find that 2(n — 1) real parameters are 
necessary. Since the second is orthogonal to the first, only 2(n — 2) real 
parameters have to be chosen. For all n vectors we need n(n — 1) = n 2 — n 
parameters. The group U„ is therefore an n 2 parameter group. 

From (10.5.13) it follows that 

dA = dUEU + + U dEU + - UEU + dUU + . 

In order to find the volume, we must transport a volume element spanned by 
vectors dA near A by the transformation A ~ 1 to the unit element: 

dA = A~ 1 dA = UE~ 1 5UEU~ 1 + UdEU~ x - UdUU~ x , 

where SU = U~ x dU, dE = E~ x dE. The volume spanned by the vectors 
dA must be identical to that spanned by the vectors dA. If we transform the 
vectors dA into S'A by S'A - U~ X SA 17, then the volumes must be preserved. 
We have 

S'A = E~ x dUE - dU + dE. 

The matrix elements of d'A are therefore given by 

(d'A) lk = (SU) lk ^ - 1 

If for U we choose the n(n — 1) parameter, and the additional n parameters 
a v , then from (10.5.14) it follows that 

dm(A) = cFf ( — — 1) dm(A) da. l ■ ■ ■ da. n , (10.5.15) 

iVk \ £ i ) 

where dm(A) only contains the n(n — 1) parameter for U. 

Since ef 1 = e h we obtain 



_ T r ^ k e Z t r % 

k<l k<l e l 

~ n (^k — E d n ( e k ~ s i) = i^i 2 > 

k<l k<l 


j + d ik i da k . (10.5.14) 
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where 


a — rife- £ k) — 

k<l 


„n - 1 C n - 2 

£ 1 e l 

m - 1 n - 2 

£ 2 e 2 


e l 1 

e 2 1 
e„ 1 


For the integration of the class functions, we may therefore use 

c\A\ 2 da,, ■ ■ ■ da„ (10.5.16) 

as the volume element. 


10.6 Representations in Hilbert Space 

A unitary representation of a finite group or a unitary representation of a 
compact group which is at least measurable in a Hilbert space Jif is com¬ 
pletely reducible; its irreducible subspaces are finite dimensional. Since the 
computations for finite groups are identical to those for compact groups 
providing that sums are replaced by integrals, we shall only prove the above 
result for compact groups. 

We introduce the following operators in 

£< v > = | e<;>(a)U(a) dm(a), (10.6.1) 

where 17(a), a e ^ are the unitary representation operators. With the aid of 
(10.5.9) we obtain: 

W = (10.6.2) 

In particular we obtain: 

4W = 8 y „S„E&. (10.6.3) 

It is easy to show that the ££? are self-adjoint, and are therefore pairwise 
orthogonal projection operators. 

For e (v) = ]T t e£? (see §10.5) we therefore find that the operators 

E (v) = £ e^(a)U(a) dm(a ) = £ £<? (10.6.4) 

T T 

are projection operators. 

Since the e£l form a complete orthogonal system in S£ 2 (^, dm), on the 
basis of (10.5.11), for h e ^£ 2 ( C S, dm) it follows that 

h = £ -4 4 ' v l f e£l(a)h(a) dm(a). 

via" 1 ’ J 
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Since / (v) (see §10.5) is spanned by the e^t (v fixed) it follows that 

h (v) = 1-4 f*»(«) e /<V) - (10.6.5) 

Ki 11 J 

From (10.5.9) and from (10.6.5) it follows that 

■ /i (v) = 5 vfl h M . 

Therefore we obtain 

h = Yj g(v) • h - ( 10 . 6 . 6 ) 

V 

For the case in which h(b) = (U(b)f, g) it follows that 

< U(b)f , £/> = X J e (v) (a)(U(a~ i b)f, g) dm(a) 

= x \e (v) (a)(U(b)f,U(a)g) dm(a) 

V J 

= X <U(b)f, E^g). 

V 

Since /, g and U(b ) are arbitrary, it follows that 

X £ <v) = X = 1 (10.6.7) 

V TV 

From (10.6.1) it follows that 

U(b)E ( ;> = je%(a)U(ba) dm(a) 

= J e$(b~ l c)U(c) dm(c) 

= J (be^)(c)U(c) dm(c ) 

= X ei ta( c )U(c) dm(c)g\ v J(b), 

T 

that is, 

U(b)E% = X E™g™(b). (10.6.8) 

T 

The fact that (^)Jf’ is, in the sense of AIV, §14, completely reducible, can 
easily proven using (10.6.8). If S' is an invariant subspace, then there exists 
(using (10.6.7)) for/ e S' a % for which ££?/ # 0. The £’■))/ (v, t fixed) span 
a finite-dimensional irreducible subspace of S'. 

The decomposition of SC in the form AIV (14.7) is obtained from 

Jf?> x SC { ? = E^JC. (10.6.9) 
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We may construct the decomposition of £ (a) ^f in the product form (10.6.9) 
as follows: Choose an / e£ <a bf and a r such that E{ x) f # 0. For h x = 
£*“ , //||£*“ ) /|| we introduce the vectors: 

h pi = EfM- 

From (10.5.8) it follows that ££ )+ = £'“’, and we therefore obtain 

<V> = <*i, = d pa <h u E^h i> = <5 p „, 

since £^h = /i. .Tff 1 can be defined in terms of the basis vectors h pl . We 
obtain 

U(a)h pl = X h al g^ p (a). 

<T 

We may choose a vector h 2 which is orthogonal to all the h pl , and proceed 
in a manner similar to the treatment of h x . We obtain in this way the vectors 
h pv . Therefore there exists a sequence of vectors h pv for which 

U(a)h pv = X h av g ( *l(a). 

a 

We now construct the Hilbert space with a basis u v and fix the following 
isomorphism 

h pv < > h pv u v . 

According to AIV (14.8) the operators U(a ) in have the form 

U(a ) = X £<° t) (L/ (!Z) (a) x 1)£<“>, (10.6.10) 

a 

where the U (x} (a) are operators of the ath irreducible representation. All 
operators which commute with the representation have, according to 
AIV (14.9) the following form: 

B = £ £ (ot) (l x B< a >)£ <a >. (10.6.11) 


10.7 Representations up to a Factor 

For quantum mechanics unitary representations up to a factor are of special 
interest because they correspond to representations in the group ,s/ of M- 
continuous effect isomorphisms (see VI). 

The fact that a unitary representation (up to a factor of a compact group) 
is completely reducible can easily be proven using the procedure given in 
§10.5: If ST is an invariant subspace, then there exists a/ e ST such that not 
all of the 

jx(b)U(b) dm(b)f = 0 
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can be satisfied. For this / the U(a)f, ae8 form an invariant subspace 
£f. In ■/ we define an operator A as follows: 

AU(a)f = U(a ) \x(b)U(b) dm(b)f. 

The operator K = A + A is nonzero. K commutes with all the U(a) and is 
compact. Therefore there exists an invariant finite-dimensional and ir¬ 
reducible subspace in £f. 

We may therefore restrict our considerations to finite-dimensional unitary 
irreducible representations. Such a representation may be given by the U(a). 
Let | 17(a)| be the determinant of U(a). It follows that | U(a) | = e ia (a real). 
By dividing U(a ) by one of the nth roots of e VJ (where n is the dimension of 
the representation) we may choose | U(a) | = 1. In particular, we may choose 
U(e) = 1. If | 17(a)| = 1, then each U(a) is uniquely determined up to one of 
the nth roots of unity. From the continuity of the map VI (3.3.8) we may 
choose a neighborhood V of the unit element such that a -> 17(a) is con¬ 
tinuous. From U(a)U(b ) = co(a, b)U(a, b ) it follows that, for a, b e V, the 
factor co(a, b ) is continuous. For b — e and U(e) = 1 it follows that 
(o(a, e) = 1. From | 17(a) \ | U(b) | = co(a, b)"\U(ab)\ it follows that V can be 
chosen so small that co(a, b) = 1 for all a, b e V. The map a -> U(a) is then 
a local homomorphism of 8. According to §10.4 it follows that this map can 
be extended to a homomorphism, that is, to a representation of the covering 
group 8* of 8. 

Every irreducible unitary representation up to a factor of 8 can therefore, 
by selection of suitable factors for each 17(a) be made into an irreducible 
unitary representation of the covering group 8 *: (a, ( €) -* U(a, If). Con¬ 
versely, every irreducible unitary representation of * produces a represen¬ 
tation of ^ up to a factor as follows: Since we have assumed that the represen¬ 
tation (a, f 4) -> U(a, ( i) of is irreducible, the elements of the fundamental 
group Jf (as a subset of the center of '$*) must be represented by multiples 
of the unit matrix. 

The U(a, 4) which represent the elements of a left coset aA" therefore 
differ only by a factor. If we define a map ^ -* by a -»(a, r 4 k ) where 
(a, <g k ) is an arbitrarily chosen element of the coset a,/V then from a -* 
(a, #*) -> U(a, ( 4 k ) we obtain a representation of 4 up to a factor. 

Given a unitary reducible representation up to a factor of < S, then we may 
reduce the representation as follows: Jf = © t v where the representations 

in 4 V are irreducible. By changing the factors for each U(a) in such a manner 
that the U(a) may generate in a selected t vl a local homomorphic represen¬ 
tation it follows that the U(a), as operators in all of x/f, form a local homo¬ 
morphic representation and therefore form a representation of W* in Jf. 
Since this representation is both a representation of <8* as well as a repre¬ 
sentation up to a factor of 8, the elements of the fundamental group Jf 
must be represented by multiples of the unit operator, that is, the representa¬ 
tion of the fundamental group Jf is equivalent in all the t v . Conversely, by 
combining such irreducible representations of 8* which yield equivalent 





10 Topological Groups 409 


representations of the fundamental group Jf to form a reducible representa¬ 
tion of <$*, then we obtain a representation of ^ up to a factor. 

If we apply the above considerations to the three-dimensional rotation 
group, then from §10.4 it follows that every unitary representation of 2% 
up to a factor corresponds to a representation of the covering group 2% in 
which the element ( e , ( i) where # is the class of paths of C according to 
Figure 52 is represented by either 1 or — 1. Physically this means that a 
system type can have only half integer or integer values of spin angular 
momentum (see, for example, VII, §3). 
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